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NOVEL MOLECULES OF THE TANGO- 77 RELATED PROTEIN 
FAMILY AND USES THEREOF 

Background of the Invention 
The polypeptide cytokine interleukin-1 (IL-1) 
5 is a critical mediator of inflammatory and overall immune 
response. To date, three members of the IL-1 family, 
IL-la, IL-ljS and IL-lra (Interleukin-1 receptor 
antagonist) have been isolated and cloned, IL-lof and 
IL-ljS are proinflammatory cytokines which elicit 

10 biological responses, whereas IL-lra is an antagonist of 
IL-la and IL-ljS activity. Two distinct cell-surface 
receptors have been identified for these ligands, the 
type 1 IL-1 receptor (IL-lRtI) and type II IL-1 receptor 
(IL-lRtll) . Recent results suggest that the IL-lRtI is 

is the receptor responsible for transducing a signal and 
producing biological effects. 

As mentioned above, IL-1 is a key mediator of the 
host inflammatory response. While inflammation is an 
important homeostatic mechanism, aberrant inflammation 

20 has the potential for inducing damage to the host. 

Elevated IL-1 levels are known to be associated with a 
number of diseases particularly autoimmune diseases and 
inflammatory disorders. 

Since II- Ira is a naturally occurring inhibitor of 

25 IL-1, IL-lra can be used to limit the aberrant and 

potentially deleterious effects of IL-1. In experimental 
animals, pretreatment with IL-lra has been shown to 
prevent death resulting from lipopolysaccharide- induced 
sepsis. The relative absence of IL-lra has also been 

30 suggested to play a role in human inflammatory bowel 
disease . 


Summary of the Invention 
The present invention is based, at least in part, 
on the discovery of a gene encoding Tango-77, a secreted 
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protein that is predicted to be a member of the cytokine 
superfamily. The Tango- 77 cDNA described below (SEQ ID 
NO:l) has three possible open reading frames. The first 
potential open reading frame encompasses 534 nucleotides 

5 extending from nucleotide 356 to nucleotide 889 of SEQ ID 
NO:l (SEQ ID NO : 3 ) and encodes a 178 amino acid protein 
(SEQ ID NO: 2) . This protein may include a predicted 
signal sequence of about 63 amino acids (from about amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 

10 NO: 4) and a predicted mature protein of about 115 amino 
acids (from about amino acid 64 to amino acid 178 of SEQ 
ID NO: 2 (SEQ ID NO: 5) ) . 

The second potential open reading frame 
encompasses 498 nucleotides extending from nucleotide 389 

is to nucleotide 889 of SEQ ID NO:l (SEQ ID NO:6) and 
encodes a 167 amino acid protein (SEQ ID NO: 7) . This 
protein may include a predicted signal sequence of about 
52 amino acids (from about amino acid 1 to about amino 
acid 52 of SEQ ID N0:7 (SEQ ID NO:8)) and a predicted 

20 mature protein of about 115 amino acids (from about amino 
acid 52 to amino acid 167 of SEQ ID NO:7 (SEQ ID NO:9)) . 

The third potential open reading frame encompasses 
408 nucleotides extending from nucleotide 481 to 
nucleotide 889 of SEQ ID N0:1 (SEQ ID NO: 10) and encodes 

25 a 13 6 amino acid protein (SEQ ID NO: 11) . This protein 
includes a predicted signal sequence of about 21 amino 
acids (from about amino acid 1 to about amino acid 21 of 
SEQ ID NO:ll (SEQ ID NO:12)) and a predicted mature 
protein of about 115 amino acids (from about amino acid 

30 22 to amino acid 136 of SEQ ID NO:ll (SEQ ID NO:13)). 

As used herein, the terms ,, Tango-77" , "Tango-77 
protein", "Tango-77 polypeptide" amd the like, can refer 
and polypeptide produced by the cDNA of SEQ ID N0:1 
including any and all of the Tango-77 gene products 
35 described above. 
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Tango- 77 is expected to inhibit inflammation and 
play a functional role similar to that of secreted 
IL-lra. For example, it is expected that Tango- 77 may 
bind to the IL-1 receptor, thus blocking receptor 
5 activation by inhibiting the binding of IL-lar and IL-1/3 
to the receptor. Alternatively, Tango- 77 may inhibit 
inflammation through another pathway, for example, by 
binding to a novel receptor. Accordingly, Tango- 77 may 
be useful as a modulating agent in regulating a variety 
10 of cellular processes including acute and chronic 

inflammation, e.g., asthma, chronic myelogenous leukemia, 
rheumatoid arthritis, psoriasis and inflammatory bowel 
disease . 

In one aspect, the invention provides isolated 

is nucleic acid molecules encoding Tango- 77 or biologically 
active portions thereof, as well as nucleic acid 
fragments suitable as primers or hybridization probes for 
the detection of Tango-77. 

The invention encompasses methods of diagnosing 

20 and treating patients who are suffering from a disorder 
associated with an abnormal level (undesirably high or 
undesirably low) of inflammation, abnormal activity of 
the IL-1 receptor complex, or abnormal activity of IL-1, 
by administering a compound that modulates the expression 

25 of Tango-77 (at the DNA, mRNA or protein level, e.g., by 
altering mRNA splicing) or by altering the activity of 
Tango-77. Examples of such compounds include small 
molecules, antisense nucleic acid molecules, ribozymes, 
and polypeptides, 

30 The invention features a nucleic acid molecule 

which is at least 45% (e.g., 55%, 65%, 75%, 85%, 95%, or 
98%) identical to the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID N0:6, SEQ ID NO:10, the 
nucleotide sequence of the cDNA insert of the plasmid 
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deposited with ATCC as Accession Number (the "cDNA of 
ATCC 98807"), or a complement thereof. 

The invention features a nucleic acid molecule 
which includes a fragment of at least 100 (e.g., 250, 

5 325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, 900, or 989) nucleotides of the nucleotide sequence 
shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID 
NO: 10, the nucleotide sequence of the cDNA ATCC 98807, or 
a complement thereof. 

o The invention also features a nucleic acid 

molecule which includes a nucleotide sequence encoding a 
protein having an amino acid sequence that is at least 
45% (55%, 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID 

5 NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, or the 
amino acid sequence encoded by the cDNA of ATCC 98807. 

In a preferred embodiment, a Tango- 77 nucleic acid 
molecule has the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10 or the 

o nucleotide sequence of the cDNA of ATCC 98807. 

Also within the invention is a nucleic acid 
molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2, SEQ ID NO:4, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 

5 NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
includes at least 15 (e.g., 25, 30, 50, 100, 150, or 178) 
contiguous amino acids of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO:ll, SEQ ID NO:12, SEQ ID NO:13, or the polypeptide 
30 encoded by the cDNA of ATCC Accession Number 98807. 

The invention includes a nucleic acid molecule 
which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID N0:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 

35 SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO:12, SEQ ID NO:13, or 
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an amino acid sequence encoded by the cDNA of ATCC 
Accession Number 98807, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, or a 
5 complement thereof under stringent conditions. 

Also within the invention are: an isolated 
Tango- 77 protein having an amino acid sequence that is at 
least about 45%, preferably 65%, 75%, 85%, 95%, or 98% 
identical to the amino acid sequence of SEQ ID NO: 5, SEQ 

10 ID NO: 9 or SEQ ID NO: 13 (mature human Tango-77) , or the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 7 or SEQ ID 
NO: 11 (immature human Tango-77), 

Also within the invention are: an isolated 
Tango-77 protein which is encoded by a nucleic acid 

is molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ 
ID NO:3, SEQ ID NO: 6, SEQ ID NO: 10 or the cDNA of ATCC 
98807; and an isolated Tango-77 protein which is encoded 
by a nucleic acid molecule having a nucleotide sequence 

20 which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO: 3, SEQ ID NO : 6 , SEQ ID NO: 10, the non-coding 
strand of the cDNA of ATCC 98807, or the complement 
thereof . 

25 Also within the invention is a polypeptide which 

is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO: 4, SEQ ID NO : 5 , SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an 

30 amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
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NO: 10 or the complement thereof under stringent 
conditions . 

Another embodiment of the invention features 
Tango- 77 nucleic acid molecules which specifically detect 

5 Tango-77 nucleic acid molecules relative to nucleic acid 
molecules encoding other members of the cytokine 
superfamily. For example, in one embodiment, a Tango-77 
nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 

10 nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3,. SEQ ID 
NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or a 
complement thereof. In another embodiment, the Tango-77 
nucleic acid molecule is at least 300 (325, 350, 375, 
400, 425, 450, 500, 550, 600, 650, 700, 800, 900, or 989) 

is nucleotides in length and hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or a 
complement thereof. In yet another embodiment, the 

20 invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a Tango-77 
nucleic acid. 

Another aspect of the invention provides a vector, 
e.g., a recombinant expression vector, comprising a 

25 Tango-77 nucleic acid molecule of the invention. In 
another embodiment, the invention provides a host cell 
containing such a vector. The invention also provides a 
method for producing Tango-77 protein by culturing, in a 
suitable medium, a host cell of the invention containing 

30 a recombinant expression vector such that a Tango-77 
protein is produced. 

Another aspect of this invention features isolated 
or recombinant Tango-77 proteins and polypeptides. 
Preferred Tango-77 proteins and polypeptides possess at 

35 least one biological activity possessed by naturally 
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occurring human Tango-77, e.g., (i) the ability to 
interact with proteins in the Tango-77 signalling pathway 
(ii) the ability to interact with a Tango-77 ligand or 
receptor; or (iii) the ability to interact with an 
5 intracellular target protein, (iv) the ability to 

interact with a protein involved in inflammation and (v) 
the ability to bind the IL-1 receptor. Other activities 
include the induction and suppression of polypeptide 
interleukins, cytokines and growth factors. 

10 The Tango-77 proteins of the present invention, or 

biologically active portions thereof, can be operably 
linked to a non-Tango-77 polypeptide (e.g., heterologous 
amino acid sequences) to form Tango-77 fusion proteins. 
The invention further features antibodies that 

is specifically bind Tango-77 proteins, such as monoclonal 
or polyclonal antibodies. In addition, the Tango-77 
proteins or biologically active portions thereof can be 
incorporated into pharmaceutical compositions, which 
optionally include pharmaceutical ly acceptable carriers. 

20 In another aspect, the present invention provides 

a method for detecting the presence of Tango-77 activity 
or expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an 
indicator of Tango-77 activity or expression such that 

25 the presence of Tango-77 activity or expression is 
detected in the biological sample. 

In another aspect, the invention provides a method 
for modulating Tango-77 activity comprising contacting a 
cell with an agent that modulates (inhibits or 

30 stimulates) 

Tango-77 activity or expression such that Tango-77 
activity or expression in the cell is modulated. In one 
embodiment, the agent is an antibody that specifically 
binds to Tango-77 protein. In another embodiment, the 
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agent modulates expression of Tango- 77 by modulating 
transcription of a Tango-77 gene, splicing of a Tango-77 
mRNA, or translation of a Tango-77 mRNA. In yet another 
embodiment, the agent is a nucleic acid molecule having a 

s nucleotide sequence that is antisense to the coding 
strand of the Tango-77 mRNA or the Tango-77 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 
characterized by aberrant Tango-77 protein activity or 

10 nucleic acid expression by administering an agent which 
is a Tango-77 modulator to the subject. In one 
embodiment, the Tango-77 modulator is a Tango-77 protein. 
In another embodiment, the Tango-77 modulator is a 
Tango-77 nucleic acid molecule. In other embodiments, 

is the Tango-77 modulator is a peptide, peptidomimetic, or 
other small molecule. In a preferred embodiment, the 
disorder characterized by aberrant Tango-77 protein or 
nucleic acid expression can include chronic and acute 
inflammation. 

20 The present invention also provides a diagnostic 

assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a Tango-77 protein; (ii) mis -regulation of a 

25 gene encoding a Tango-77 protein; and (iii) aberrant 
post-translational modification of a Tango-77 protein, 
wherein a wild-type form of the gene encodes a protein 
with a Tango-77 activity. 

In another aspect, the invention provides a 

30 method for identifying a compound that binds to or 
modulates the activity of a Tango-77 protein. In 
general, such methods entail measuring a biological 
activity of a Tango-77 protein in the presence and 
absence of a test compound and identifying those 
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compounds which alter the activity of the Tango-77 
protein. 

The invention also features methods for 
identifying a compound which modulates the expression of 
5 Tango-77 by measuring the expression of Tango-77 in the 
presence and absence of a compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims. 

10 Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence (SEQ ID NO:l) 
of Tango-77. The Tango-77 cDNA has three possible open 
reading frames which encode the amino acid sequence (SEQ 
ID NO: 2, SEQ ID NO: 7 and SEQ ID NO: 11) of human Tango-77. 

is The three potential open reading frames of SEQ ID NO:l 
extend from: (1) nucleotide 356 to nucleotide 889 (SEQ ID 
NO:3); (2) nucleotide 389 to nucleotide 889 (SEQ ID 
NO:6); and (3) nucleotide 481 to nucleotide 889 (SEQ ID 
NO:10) . 

20 Figure 2 depicts an alignment of an amino acid 

sequence of Tango-77 (T77; SEQ ID NO: 2) with IL-1RA (SEQ 
ID NO:14), and IL-10 (SEQ ID NO:15). 

Figure 3 depicts the genomic sequence of BAC1 (SEQ 
ID NO: 16) . 

25 Figure 4 depicts the genomic sequence of BAG 2 (SEQ 

ID NO: 17) . 

Figure 5 depicts an amino acid sequence of an 
alternatively spliced form of Tango-77 (SEQ ID NO: 2) as 
predicted by Procrustes (T77-procrustes; SEQ ID NO:18). 
30 Figure 6 depicts an alignment of an amino acid 

sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with Tango-77 (SEQ ID 
NO:2) . 
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Figure 7 depicts an alignment of an amino acid 
sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with IL-lra (SEQ ID 
NO: 14), and IL-1/3 (SEQ ID NO:15). 

5 Detailed Description of the Invention 

The present invention is based on the discovery of 
a cDNA molecule encoding human Tango-77, a member of the 
cytokine superfamily. The cDNA molecule encoding human 
Tango-77 has three possible open reading frames. The 

10 three possible nucleotide open reading frames for human 
Tango-77 protein are shown in Figure 1 (SEQ ID NO: 3, SEQ 
ID NO: 6 and SEQ ID NO: 10) . The predicted amino acid 
sequence for the three possible Tango-77 immature 
proteins are also shown in 

15 Figure 1 (SEQ ID NO:2, SEQ ID NO:7 or SEQ ID NO:ll) and 
three possible mature proteins are also shown in Figure 1 
(SEQ ID NO: 5/ SEQ ID NO: 9 and SEQ ID NO: 13) . 

The Tango-77 cDNA of Figure 1 (SEQ ID N0:1), which 
is approximately 989 nucleotides long including 

20 untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 19 kDa, 18 kDa, or 
14.9 KDa (excluding post- translational modifications) and 
the possible mature form of the protein has a molecular 
weight of 13 kDa. A plasmid containing a cDNA encoding 

25 human Tango-77 (with the cDNA insert name of Of fthx077) 
was deposited with American Type Culture Collection 
(ATCC) , 10801 University Boulevard, Manassas, Virginia 
20110-2209 on July 2, 1998 and assigned Accession Number 
98807, This deposit will be maintained under the terms 

30 of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made merely as a 
convenience for those of skill in the art and is not an 
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admission that a deposit is required under 3 5 U.S.C. 
§112 . 

Human Tango-77 is one member of a family of 
molecules (the "Tango-77 family") having certain 
5 conserved structural and functional features. The term 
"family, " when referring to the protein and nucleic acid 
molecules of the invention, is intended to mean two or 
more proteins or nucleic acid molecules having a common 
structural domain and having sufficient amino acid or 

10 nucleotide sequence identity as defined herein. Such 

family members can be naturally occurring and can be from 
either the same or different species. For example, a 
family can contain a first protein of human origin and a 
homologue of that protein of murine origin, as well as a 

is second, distinct protein of human. origin and a murine 
homologue of that protein. Members of a family may also 
have common functional characteristics. 

As used interchangeably herein a "Tango-77 
activity", "biological activity of Tango-77" or 

20 "functional activity of Tango-77", refers to an activity 
exerted by a Tango-77 protein, polypeptide or nucleic 
acid molecule on a Tango-77 responsive cell as determined 
in vivo, or in vitro, according to standard techniques. 
A Tango-77 activity can be a direct activity, such as an 

25 association with a second protein, or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the Tango-77 protein with a second 
protein. In a preferred embodiment, a Tango-77 activity 
includes at least one or more of the following 

30 activities: (i) the ability to interact with proteins in 
the Tango-77 signalling pathway (ii) the ability to 
interact with a Tango-77 ligand or receptor; or (iii) the 
ability to interact with an intracellular target protein, 
(iv) the ability to interact with a protein involved in 
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inflammation, and (v) the ability to bind the IL-1 
receptor. 

Accordingly, another embodiment of the invention 
features isolated Tango-77 proteins and polypeptides 
5 having a Tango-77 activity. 

Yet another embodiment of the invention features 
Tango-77 molecules which contain a signal sequence. 
Generally, a signal sequence (or signal peptide) is a 
peptide containing about 21 to 63 amino acids which 

10 occurs at the extreme N-terminal end of a secretory 
protein. The native Tango-77 signal sequence (SEQ ID 
N0:4, SEQ ID NO: 8, or SEQ ID NO: 12) can be removed and 
replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), 

is expression and/or secretion of Tango-77 can be increased 
through use of a heterologous signal sequence. For 
example, the gp67 secretory sequence of the baculovirus 
envelope protein can be used as a heterologous signal 
sequence. Alternatively, the native Tango-77 signal 

20 sequence can itself be used as a heterologous signal 
sequence in expression systems, e.g., to facilitate the 
secretion of a protein of interest. 

Various aspects of the invention are described in 
further detail in the following subsections. 

25 I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode Tango-77 proteins or 
biologically active portions thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes 

30 to identify Tango- 77 -encoding nucleic acids (e.g., 

Tango-77 mRNA) and fragments for use as PCR primers for 
the amplification or mutation of Tango-77 nucleic acid 
molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
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cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 
stranded or double -stranded, but preferably is double- 
5 stranded DNA. 

An "isolated" nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 

10 sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 

is isolated Tango-77 nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb 
of nucleotide sequences which naturally flank the nucleic 
acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" nucleic 

20 acid molecule, such as a cDNA molecule, can be 

substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

25 A nucleic acid molecule of the present invention, 

e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, the cDNA of ATCC 98807, or a complement of any of 
these nucleotide sequences, can be isolated using 

30 standard molecular biology techniques and the sequence 
information provided herein. Using all or a portion of 
the nucleic acid sequences of SEQ ID N0:1, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof as a hybridization probe, Tango-77 

35 nucleic acid molecules can be isolated using standard 
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hybridization and cloning techniques (e.g., as described 
in Sambrook et al - , eds ♦ , Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 

5 Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so 

io amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to Tango-77 nucleotide 
sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 

is In another preferred embodiment, an isolated 

nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO:6, SEQ ID NO: 10 the cDNA of ATCC 98807, or a 

20 portion thereof. A nucleic acid molecule which is 

complementary to a given nucleotide sequence is one which 
is sufficiently complementary to the given nucleotide 
sequence that it can hybridize to the given nucleotide 
sequence thereby forming a stable duplex. 

25 Moreover, the nucleic acid molecule of the 

invention can comprise only a portion of a nucleic acid 
sequence encoding Tango-77, for example, a fragment which 
can be used as a probe or primer or a fragment encoding a 
biologically active portion of Tango-77. The nucleotide 

30 sequence determined from the cloning of the human 

Tango-77 gene allows for the generation of probes and 
primers designed for use in identifying and/or cloning 
Tango-77 homologues in other cell types, e.g., from other 
tissues, as well as Tango-77 homologues from other 

35 mammals. The probe/primer typically comprises 
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substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 
conditions to at least about 12, preferably about 25, 
5 more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or ant i- sense sequence of SEQ ID N0:1, SEQ ID N0:3, SEQ 
ID NO: 6, SEQ ID NO: 10, or the cDNA of ATCC 98807. 
Alternatively, the oligonucleotide can typically comprise 

10 a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 
about 25, more preferably about 50, 75, 100, 125, 150, 
175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
the sense or anti-sense sequence of a naturally occurring 

15 mutant of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, or the cDNA of ATCC 98807. 

Probes based on the human Tango- 77 nucleotide 
sequence can be used to detect transcripts or genomic 
sequences encoding the same or identical proteins. The 

20 probe comprises a label group attached thereto, e.g. , a 
radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissues 
which mis -express a Tango- 77 protein, such as by 

25 measuring a level of a Tango -77 -encoding nucleic acid in 
a sample of cells from a subject, e.g., detecting 
Tango- 7 7 mRNA levels or determining whether a genomic 
Tango- 77 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically 

30 active portion of Tango- 77" can be prepared by isolating 
a portion of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ 
ID NO: 10 or the nucleotide sequence of the cDNA of ATCC 
98807 which encodes a polypeptide having a Tango-77 
biological activity, expressing the encoded portion of 

35 Tango-77 protein (e.g., by recombinant expression in 
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vitro) and assessing the activity of the encoded portion 
of Tango-77. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 
5 ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, or the 
cDNA of ATCC 98807 due to degeneracy of the genetic code 
and thus encode the same Tango-77 protein as that encoded 
by the nucleotide sequence shown in SEQ ID NO:l, SEQ ID 
NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, or the cDNA of ATCC 
10 98307. 

In addition to the human Tango-77 nucleotide 
sequence shown in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, 
SEQ ID NO: 10, or the cDNA of ATCC 98807, it will be 
appreciated by those skilled in the art that DNA sequence 

is polymorphisms that lead to changes in the amino acid 
sequences of Tango-77 may exist within a population 
(e.g., the human population). Such genetic polymorphism 
in the Tango-77 gene may exist among individuals within a 
population due to natural allelic variation. An allele 

20 is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the term "allelic 
variant" refers to a nucleotide sequence which occurs at 
a Tango-77 locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 

25 and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a Tango-77 
protein, preferably a mammalian Tango-77 protein. Such 
natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of the Tango-77 gene. 

30 Alternative alleles can be identified by sequencing the 
gene of interest in a number of different individuals. 
This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of 
individuals. Any and all such nucleotide variations and 

35 resulting amino acid polymorphisms or variations in 
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Tango- 77 that are the result of natural allelic variation 
and that do not alter the functional activity of Tango- 77 
are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding Tango-77 
5 proteins from other species (Tango-77 homologues) , which 
have a nucleotide sequence which differs from that of a 
human Tango-77, are intended to be within the scope of 
the invention. Nucleic acid molecules corresponding to 
natural allelic variants and homologues of the Tango-77 

10 cDNA of the invention can be isolated based on their 
identity to the human Tango-77 nucleic acids disclosed 
herein using the human cDNAs, or a portion thereof, as a 
hybridization probe according to standard hybridization 
techniques under stringent hybridization conditions . 

is Accordingly, in another embodiment, an isolated 

nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, or 989) nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule 

20 comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:l, SEQ ID N0:3, SEQ ID NO: 6, SEQ 
ID NO: 10, or the cDNA of ATCC 98807, 

As used herein, the term "hybridizes under 
stringent conditions" is intended to describe conditions 

25 for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 

30 in Molecular Biology, John Wiley Sc Sons, N.Y. (1989), 
6,3.1-6.3.6. A preferred, non-limiting example of 
stringent hybridization conditions are hybridization in 
6X sodium chloride /sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 

35 50-65°C. Preferably, an isolated nucleic acid molecule 
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of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807 r or the 
complement thereof, corresponds to a naturally-occurring 
s nucleic acid molecule. As used herein, a "naturally- 
occurring" nucleic acid molecule refers to an RNA or DNA 
molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic 
10 variants of the Tango- 77 sequence that may exist in the 
population, the skilled artisan will further appreciate 
that changes can be introduced by mutation into the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10 or the cDNA of ATCC 98807, thereby 
is leading to changes in the amino acid sequence of the 

encoded Tango-77 protein, without altering the biological 
activity of the Tango-77 protein. Amino acid residues 
that are not conserved or only semiconserved among Tango- 
77 of various species may be non-essential for activity 
20 and thus would likely be targets for alteration. 

Alternatively, one can make nucleotide substi tut ions 
leading to amino acid substitutions at "non-essential" 
amino acid residues. A "non-essential" amino acid 
residue is a residue that can be altered from the wild- 
25 type sequence of Tango-77 (e.g., the sequence of SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11 
or SEQ ID NO: 13) without altering the biological 
activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino 
30 acid residues that are conserved among the Tango-77 

proteins of various species may be essential for activity 
and thus would not likely be targets for alteration, 
unless one wishes to reduce or alter Tango-77 activity. 
Accordingly, another aspect of the invention 
35 pertains to nucleic acid molecules encoding Tango-77 
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proteins that contain changes in amino acid residues that 
are not essential for activity. Such Tango-77 proteins 
differ in amino acid sequence from SEQ ID NO: 2, SEQ ID 
NO: 5, SEQ ID NO : 7 , SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID 
5 NO: 13 yet retain biological activity. In one embodiment, 
the isolated nucleic acid molecule includes a nucleotide 
sequence encoding a protein that includes an amino acid 
sequence that is at least about 45% identical, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of 

10 SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ 
ID NO: 11, or SEQ ID NO: 13. 

An isolated nucleic acid molecule encoding a 
Tango-77 protein having a sequence which differs. from 
that of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 

15 NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 can be created by 
introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, or 
the cDNA of ATCC 98807 such that one or more amino acid 

20 substitutions, additions or deletions are introduced into 
the encoded protein. Mutations can be introduced by 
standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more 

25 predicted non-essential amino acid residues. A 

"conservative amino acid substitution" is one in which 
the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino 
acid residues having similar side chains have been 

30 defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, 
histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, 

35 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
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valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine) * 
5 Thus, a predicted nonessential amino acid residue in 
Tango-77 is preferably replaced with another amino acid 
residue from the same side chain family. Alternatively, 
mutations can be introduced randomly along all or part of 
a Tango- 77 coding sequence, such as by saturation 
10 mutagenesis, and the resultant mutants can be screened 
for Tango-77 biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded 
protein can be expressed recombinantly and the activity 
of the protein can be determined. 
15 In a preferred embodiment, a mutant Tango-77 

protein can be assayed for: (1) the ability to form 
protein: protein interactions with proteins in the 
Tango-77 signalling pathway; (2) the ability to bind a 
Tango-77 ligand or receptor; or (3) the ability to bind 
20 to an intracellular target protein or (4) the ability to 
interact with a protein involved in inflammation or (5) 
the ability to bind the IL-1 receptor. In yet another 
preferred embodiment, a mutant Tango-77 can be assayed 
for the ability to modulate inflammation, asthma, 
25 autoimmune dieseases, and sepsis. 

The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 
complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a double- 
30 stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
nucleic acid can be complementary to an entire Tango-77 
coding strand, or to only a portion thereof, e.g., all or 
35 part of the protein coding region (or open reading 
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frame) . An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 
nucleotide sequence encoding Tango-77. The noncoding 
regions ("5' and 3' untranslated regions") are the 5' and 
5 3' sequences which flank the coding region and are not 
translated into amino acids. 

Given the coding strand sequences encoding 
Tango-77 disclosed herein (e.g.,SEQ ID NO:3, SEQ ID NO:5, 
or SEQ ID NO:8), antisense nucleic acids of the invention 
10 can be designed according to the rules of Watson and 

Crick base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of 
Tango-77 mRNA, but more preferably is an oligonucleotide 
which is antisense to only a portion of the coding or 
is noncoding region of Tango-77 mRNA. For example, the 
antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of Tango-77 
mRNA, e.g., an oligonucleotide having the sequence 
5 ' - TGCAACTTTTACAGGAAACAC - 3' (SEQ ID NO:19) or 
20 5 ' - CCTCACTTTTACCCGAGACTC - 3 ' ( SEQ ID NO : 2 0 ) or 

5' -GACGGGTGGTACTTAAAACAA-3' (SEQ ID N0:21). An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 
25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be 
25 constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
30 modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
35 Examples of modified nucleotides which can be used to 
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generate the antisense nucleic acid include 5- 
f luorouracil , 5-bromouracil , 5-chlorouracil , 
5-iodouracil , hypoxanthine, xanthine, 4-acetylcytosine, 
5- (carboxyhydroxylmethyl) uracil, 5- 

5 carboxymethyl aminomethyl -2 - thiouridine , 

5-carboxymethylaminomethyluracil , dihydrouracil , 
beta-D-galactosylqueosine, inosine, N6- 
isopentenyladenine , 1-methylguanine , 1 -methylinosine , 
2 , 2 -dimethylguanine , 2 -methyladenine , 2 -methyl guanine , 3 - 

10 methylcytosine, 5 -methylcytosine, N6-adenine, 7- 
methylguanine , 5 -methylaminomethyluracil , 5 - 
me t hoxy ami nome t hy 1 - 2 - 1 h i our a c i 1 , beta-D- mannosy 1 que o s i ne , 
5' -methoxycarboxymethyluracil , 5-methoxyuracil, 2- 
methylthio-N6-isopentenyladenine, uracil- 5 -oxyacetic acid 

is (v) , wybutoxosine, pseudouracil, queosine, 2- 

thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil , 4- 
thiouracil, 5-methyluracil , uracil -5 -oxyacetic acid 
methylester, uracil -5 -oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil 

20 (acp3)w, and 2 , 6-diaminopurine . Alternatively, the 

antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 

25 antisense orientation to a target nucleic acid of 

interest, described further in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 

30 to cellular mRNA and/ or genomic DNA encoding a Tango- 77 
protein to thereby inhibit expression of the protein, 
e.g., by inhibiting transcription and/or translation. 
The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, 
35 in the case of an antisense nucleic acid molecule which 
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binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a 
route of administration of antisense nucleic acid 
molecules of the invention includes direct injection at a 
5 tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 

10 antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 

is To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

20 An antisense nucleic acid molecule of the 

invention can be an a-anomeric nucleic acid molecule. An 
a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual j3-units, the strands run parallel 

25 to each other (Gaultier et al. (1987) Nucleic Acids Res. 
15:6625-6641) . The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA -DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327- 

30 330) . 

The invention also encompasses ribozymes. 
Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a 
35 complementary region. Thus, ribozymes (e.g., hammerhead 
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ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave 
Tango- 77 mRNA transcripts to thereby inhibit translation 
of Tango- 77 mRNA. A ribozyme having specificity for a 
Tango-77 -encoding nucleic acid can be designed based upon 
the nucleotide sequence of a Tango-77 cDNA disclosed 
herein (e.g., SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ 
ID NO: 10) . For example, a derivative of a Tetrahymena L- 
19 IVS RNA can be constructed in which the nucleotide 
sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a Tango- 77 -encoding 
mRNA. See, e.g., Cech et al . U.S. Patent No. 4,987,071; 
and Cech et al . U.S. Patent No. 5,116,742. 
Alternatively, Tango-77 mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel and 
Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
o example, Tango-77 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the 
regulatory region of the Tango-77 (e.g., the Tango-77 
promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the Tango-77 
5 gene in target cells. See generally, Helene (1991) 

Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. N.Y. 
Acad. Sci. 660:27-36; and Maher (1992) Bioassays 
14 (12) :807-15. 

In preferred embodiments, the nucleic acid 
o molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate 
5 peptide nucleic acids (see Hyrup et al. (1996) Bioorganic 


WO 99/06426 


PCT/US98/16102 


- 25 - 

& Medicinal Chemistry 4(1) : 5-23) . As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopept ide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry-O' Keef e et al . (1996) Proc. Natl. 
Acad. Sci. USA 93: 14670-675. 

PNAs of Tango- 77 can be used in therapeutic and 
diagnostic applications. For example, PNAs can be used 
as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing 
transcription or translation arrest or inhibiting 
replication. PNAs of Tango-77 can also be used, e.g., in 
the analysis of single base pair mutations in a gene by, 
e.g., PNA directed PGR clamping; as artificial 
restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as 
probes or primers for DNA sequence and hybridization 
(Hyrup (1996) supra; Perry-O' Keef e et al . (1996) Proc. 
Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of Tango-77 can be 
modified, e.g., to enhance their stability or cellular 
uptake, by attaching lipophilic or other helper groups to 
PNA, by the formation of PNA -DNA chimeras, or by the use 
of liposomes or other techniques of drug delivery known 
in the art. For example, PNA- DNA chimeras of Tango-77 
can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, 
to interact with the DNA portion while the PNA portion 
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would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of 
appropriate lengths selected in terms of base stacking, 
number of bonds between the nucleobases, and orientation 
5 (Hyrup (1996) supra) . The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) supra and 
Finn et al . (1996) Nucleic Acids Res. 24 (17) : 3357-63 . 
For example, a DNA chain can be synthesized on a solid 
support using standard phosphoramidite coupling chemistry 

10 and modified nucleoside analogs. Compounds such as 5'- 
(4-methoxytrityl) amino-5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 
DNA (Mag et al . (1989) Nucleic Acid Res. 17:5973-88). 
PNA monomers are then coupled in a stepwise manner to 

is produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al. (1996) Nucleic Acids Res. 
24 (17) :3357-63) . Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al. (1975) Bioorganic Med. Chem. Lett. 

20 5 : 1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 

25 e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 

30 modified with hybridization-triggered cleavage agents 
(see, e.g., Krol et al. (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
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hybridization triggered cross -linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

II. Isolated Tango-77 Proteins and Anti-Tanoo-77 
Antibodies 

One aspect of the invention pertains to isolated 
Tango-77 proteins, and biologically active portions 
thereof, as well as polypeptide fragments suitable for 
use as immunogens to raise anti-Tango-77 antibodies. In 
one embodiment, native Tango-77 proteins can be isolated 
from cells or tissue sources by an appropriate 
purification scheme using standard protein purification 
techniques. In another embodiment, Tango-77 proteins are 
produced by recombinant DNA techniques. Alternative to 
recombinant expression, a Tango-77 protein or polypeptide 
can be synthesized chemically using standard peptide 
synthesis techniques. 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
the cell or tissue source from which the Tango-77 protein 
is derived, or substantially free of chemical precursors 
or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" 
includes preparations of Tango-77 protein in which the 
protein is separated from cellular components of the 
cells from which it is isolated or recombinantly 
produced. Thus, Tango-77 protein that is substantially 
free of cellular material includes preparations of 
Tango-77 protein having less than about 30%, 20%, 10%, or 
5% (by dry weight) of non-Tango-77 protein (also referred 
to herein as a "contaminating protein") . When the 
Tango-77 protein or biologically active portion thereof 
is recombinantly produced, it is also preferably 
substantially free of culture medium, i.e., culture 


WO 99/06426 


PCT/US98/I6102 


- 28 - 

medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When Tango-77 protein 
is produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 

5 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the protein. Accordingly such preparations of Tango-77 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or non-Tango-77 chemicals. 

io Biologically active portions of a Tango-77 protein 

include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the Tango-77 protein (e.g., the amino acid 
sequence shown in SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 7, 

15 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ- ID NO: 13), which 

include fewer amino acids than the full length Tango-77 
proteins, and exhibit at least one activity of a Tango-77 
protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 

20 the Tango-77 protein. A biologically active portion of a 
Tango-77 protein can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 

25 prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native Tango-77 
protein. 

Preferred Tango-77 protein has the amino acid 
sequence shown of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO: 7, 

30 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13. Other useful 
Tango-77 proteins are substantially identical to SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO: 11, or SEQ ID NO: 13 and retain the functional activity 
of the protein of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 7, 

35 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 yet differ in 
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amino acid sequence due to natural allelic variation or 
mutagenesis. Accordingly, a useful Tango-77 protein is a 
protein which includes an amino acid sequence at least 
about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99% 
s identical to the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, or SEQ 
ID NO: 13 and retains the functional activity of the 
Tango-77 proteins of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13. In a 
10 preferred embodiment, the Tango-77 protein retains a 
functional activity of the Tango-77 protein of SEQ ID 
NO:2, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO : 9 , SEQ ID 
NO:ll, or SEQ ID NO:13. 

To determine the percent identity of two amino 
is acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
20 or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
25 that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity » # 
of identical posit ions/ total # of positions, e.g., 
overlapping x 100) . Preferably, the two sequences are 
30 the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non- limiting example of a 
mathematical algorithm utilized for the comparison of two 
35 sequences is the algorithm of Karlin arid Altschul (1990) 
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Proc. Natl, Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) 
5 J. Mol. Biol. 215:403-410. BLAST nucleotide searches can 
be performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to Tango- 77 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST 

10 program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to Tango- 77 protein molecules of the 
invention. To obtain gapped alignments for comparison 
purposes , Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 

is When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. Another preferred, non- 
limiting example of a mathematical algorithm utilized for 

20 the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
the ALIGN program for comparing amino acid sequences, a 

25 PAM120 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 

30 percent identity, only exact matches are counted. 

The invention also provides Tango- 77 chimeric or 
fusion proteins. As used herein, a Tango-77 "chimeric 
protein" or "fusion protein" comprises a Tango-77 
polypeptide operably linked to a non-Tango-77 

35 polypeptide. A "Tango-77 polypeptide" refers to a 
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polypeptide having an amino acid sequence corresponding 
to Tango-77 polypeptides, whereas a "non- Tango- 77 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not 
s substantially identical to the Tango-77 protein, e.g., a 
protein which is different from the Tango-77 protein and 
which is derived from the same or a different organism. 
Within a Tango-77 fusion protein the Tango-77 polypeptide 
can correspond to all or a portion of a Tango-77 protein, 

10 preferably at least one biologically active portion of a 
Tango-77 protein. Within the fusion protein, the term 
"operably linked" is intended to indicate that the 
Tango-77 polypeptide and the non-Tango-77 polypeptide are 
fused in- frame to each other. The non-Tango-77 

15 polypeptide can be fused to the N-terminus or C-terminus 
of the Tango-77 polypeptide. 

One useful fusion protein is a GST-Tango-77 fusion 
protein in which the Tango-77 sequences are fused to the 
C-terminus of the GST sequences. Such fusion proteins 

20 can facilitate the purification of recombinant Tango-77. 

In another embodiment, the fusion protein is a 
Tango-77 protein containing a heterologous signal 
sequence at its N-terminus. For example, the native 
Tango-77 signal sequence (i.e., about amino acids 1 to 63 

25 of SEQ ID NO: 2; SEQ ID NO: 4; or about amino acids 1 to 52 
of SEQ ID NO: 7, -SEQ ID NO: 8; or about amino acids 1 to 21 
of SEQ ID NO: 11; SEQ ID NO: 12) can be removed and replaced 
with a signal sequence from another protein. In certain 
host cells (e.g., mammalian host cells), expression 

30 and/or secretion of Tango-77 can be increased through use 
of a heterologous signal sequence. For example, the gp67 
secretory sequence of the baculovirus envelope protein 
can be used as a heterologous signal sequence (AusuJbel et 
al., supra). Other examples of eukaryotic heterologous 

35 signal sequences include the secretory sequences of 
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melittin and human placental alkaline phosphatase 
(Stratagene; La Jolla, California) . In yet another 
example, useful prokaryotic heterologous signal sequences 
include the phoA secretory signal (Sambrook et al., 
5 supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 

In yet another embodiment, the fusion protein is 
an Tango- 77 -immunoglobulin fusion protein in which all or 
part of Tango- 77 is fused to sequences derived from a 

10 member of the immunoglobulin protein family. The 

Tango-77-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and 
administered to a subject to inhibit an interaction 
between a Tango- 77 ligand and a Tango-77 receptor on the 

is surface of a cell, to thereby suppress Tango-77-mediated 
signal transduction in vivo. The Tango-77-immunoglobulin 
fusion proteins can be used to affect the bioavailability 
of a Tango-77 cognate ligand. Inhibition of the Tango-77 
ligand/Tango-77 interaction may be useful therapeutically 

20 for both the treatment of inflammatory and autoimmune 
disorders. Moreover, the Tango-77 -immunoglobulin fusion 
proteins of the invention can be used as immunogens to 
produce anti -Tango- 77 antibodies in a subject, to purify 
Tango-77 ligands and in screening assays to idencify 

25 molecules which inhibit the interaction of Tango-77 with 
a Tango-77 receptor. 

Preferably, a Tango-77 chimeric or fusion protein 
of the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the 

30 different polypeptide sequences are ligated together in- 
frame in accordance with conventional techniques, for 
example by employing blunt -ended or stagger -ended termini 
for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling- in of cohesive ends as 

35 appropriate, alkaline phosphatase treatment to avoid 
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undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene 
5 fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
annealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Current Protocols in Molecular 

10 Biology, Ausubel et al. eds., John Wiley & Sons: 1992). 
Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a 
GST polypeptide) . An Tango- 77 -encoding nucleic acid can 
be cloned into such an expression vector such that the 

is fusion moiety is linked in-frame to the Tango-77 protein. 

The present invention also pertains to variants of 
the Tango-77 proteins (i.e., proteins having a sequence 
which differs from that of the Tango-77 amino acid 
sequence) . Such variants can function as either Tango-77 

20 agonists (mimetics) or as Tango-77 antagonists. Variants 
of the Tango-77 protein can be generated by mutagenesis, 
e.g., discrete point mutation or truncation of the 
Tango-77 protein. An agonist of the Tango-77 protein can 
retain substantially the same, or a subset, of the 

25 biological activities of the naturally occurring form of 
the Tango-77 protein. An antagonist of the Tango-77 
protein can inhibit one or more of the activities of the 
naturally occurring form of the Tango-77 protein by, for 
example, competitively binding to a downstream or 

30 upstream member of a cellular signaling cascade which 

includes the Tango-77 protein. Thus, specific biological 
effects can be elicited by treatment with a variant of 
limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the 

35 naturally occurring form of the protein can have fewer 
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side effects in a subject relative to treatment with the 
naturally occurring form of the Tango-77 proteins. 

Variants of the Tango-77 protein which function as 
either Tango-77 agonists (mimetics) or as Tango-77 
5 antagonists can be identified by screening combinatorial 
libraries of mutants, e.g., truncation mutants, of the 
Tango-77 protein for Tango-77 protein agonist or 
antagonist activity. In one embodiment, a variegated 
library of Tango-77 variants is generated by 

10 combinatorial mutagenesis at the nucleic acid level and 
is encoded by a variegated gene library. A variegated 
library of Tango-77 variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 

is degenerate set of potential Tango-77 sequences is 

expressible as individual polypeptides, or alternatively, 
as a set of larger fusion proteins (e.g., for phage 
display) containing the set of Tango-77 sequences 
therein. There are a variety of methods which can be 

20 used to produce libraries of potential Tango-77 variants 
from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed 
in an automatic DNA synthesizer, and the synthetic gene 
then ligated into an appropriate expression vector. Use 

25 of a degenerate set of genes allows for the provision, in 
one mixture, of all of the sequences encoding the desired 
set of potential Tango-77 sequences. Methods for 
synthesizing degenerate oligonucleotides are known in the 
art (see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 

30 et al. (1984) Annu. -Rev. Biochem. 53:323; Itakura et al . 
(1984) Science 198:1056; Ike et al . (1983) Nucleic Acid 
Res. 11:477) . 

In addition, libraries of fragments of the 
Tango-77 protein coding sequence can be used to generate 

35 a variegated population of Tango-77 fragments for 
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screening and subsequent selection of variants of a 
Tango- 77 protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double 
stranded PCR fragment of a Tango- 77 coding sequence with 
5 a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded 
DNA, renaturing the DNA to form double stranded DNA which 
can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed 

10 duplexes by treatment with SI nuclease, and ligating the 
resulting fragment library into an expression vector. By 
this method, an expression library can be derived which 
encodes N- terminal and internal fragments of various 
sizes of the Tango-77 protein. 

is Several techniques are known in the art for 

screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 

20 gene libraries generated by the combinatorial mutagenesis 
of Tango-77 proteins. The most widely used techniques, 
which are amenable to high through-put analysis, for 
screening large gene libraries typically include cloning 
the gene library into replicable expression vectors, 

25 transforming appropriate cells with the resulting library 
of vectors, and expressing the combinatorial genes under 
conditions in which detection of a desired activity 
facilitates isolation of the vector encoding the gene 
whose product was detected. Recursive ensemble 

30 mutagenesis (REM) , a technique which enhances the 

frequency of functional mutants in the libraries, can be 
used in combination with the screening assays to identify 
Tango-77 variants (Arkin and Yourvan (1992) Proc. Natl. 
Acad. Sci. USA 89:7811-7815; Delgrave et al . (1993) 

35 Protein Engineering 6(3) : 327-331) . 
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An isolated Tango- 77 protein, or a portion or 
fragment thereof, can be used as an immunogen to generate 
antibodies that bind Tango- 77 using standard techniques 
for polyclonal and monoclonal antibody preparation. The 
5 full-length Tango-77 protein can be used or, 

alternatively, the invention provides antigenic peptide 
fragments of Tango-77 for use as immunogens. The 
antigenic peptide of Tango-77 comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the 

10 amino acid sequence shown in SEQ ID NO:2, SEQ ID NO: 5, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll or SEQ ID NO:13 
and encompasses an epitope of Tango-77 such that an 
antibody raised against the peptide forms a specific 
immune complex with Tango-77. 

is A Tango-77 immunogen typically is used to prepare 

antibodies by immunizing a suitable subject (e.g., 
rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for 
example, recombinant ly expressed Tango-77 protein or a 

20 chemically synthesized Tango-77 polypeptide. The 
preparation can further include an adjuvant, such as 
Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic Tango-77 preparation induces 

25 a polyclonal anti-Tango-77 antibody response. 

Accordingly, another aspect of the invention 
pertains to anti-Tango-77 antibodies. The term 
"antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 

30 immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as Tango-77. A molecule which specifically binds to 
Tango-77 is a molecule which binds Tango-77, but does not 
substantially bind other molecules in a sample, e.g., a 

35 biological sample, which naturally contains Tango-77. 


WO 99/06426 


PCT/US98/16102 


- 37 - 

Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab') 2 
fragments which can be generated by treating the antibody 
with an enzyme such as pepsin. The invention provides 
5 polyclonal and monoclonal antibodies that bind Tango- 77. 
The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a 

10 particular epitope of Tango-77. A monoclonal antibody 
composition thus typically displays a single binding 
affinity for a particular Tango-77 protein with which it 
immunoreacts . 

Polyclonal anti -Tango- 77 antibodies can be 

is prepared as described above by immunizing a suitable 
subject with a Tango-77 immunogen. The anti -Tango- 7 7 
antibody titer in the immunized subject can be monitored 
over time by standard techniques , such as with an enzyme 
linked immunosorbent assay (EL ISA) using immobilized 

20 Tango-77. If desired, the antibody molecules directed 
against Tango-77 can be isolated from the mammal (e.g., 
from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain 
the IgG fraction. At an appropriate time after 

25 immunization, e.g. , when the anti -Tango- 77 antibody 
titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal 
antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein 

30 (1975) Nature .256 :495-497, the human B cell hybridoma 
technique (Kozbor et al . (1983) Immunol Today 4:72), the 
EBV-hybridoma technique (Cole et al. (1985), Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96) or trioma techniques. The technology for 

35 producing hybridomas is well known (see generally Current 
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Protocols in Immunology (1994) Coligan et al . (eds.) John 
Wiley & Sons, Inc., New York, NY). Briefly, an immortal 
cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a 
5 Tango-77 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal 
antibody that binds Tango-77. 

Any of the many well known protocols used for 

10 fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti -Tango- 77 
monoclonal antibody (see, e.g., Current Protocols in 
Immunology, supra; Galfre et al . (1977) Nature 266:55052; 
R.H. Kenneth, in Monoclonal Antibodies: A New Dimension 

is In Biological Analyses, Plenum Publishing Corp., New 
York, New York (1980); and Lemer (1981) Yale J. Biol. 
Med., 54:387-402. Moreover, the ordinarily skilled 
worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the 

20 immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For 
example, murine hybridomas can be made by fusing 
lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized 

25 mouse cell line, e.g., a myeloma cell line that is 
sensitive to culture medium containing hypoxanthine, 
aminopterin and thymidine ("HAT medium") . Any of a 
number of myeloma cell lines can be used as a fusion 
partner according to standard techniques, e.g., the P3- 

30 NSl/l-Ag4-l, P3-x63-Ag8 . 653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, 
HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG") . Hybridoma 
cells resulting from the fusion are then selected using 

35 HAT medium, which kills unfused and unproductively fused 
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myeloma cells (unfused splenocytes die after several days 
because they are not transformed) . Hybridoma cells 
producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supernatants 
5 for antibodies that bind Tango-77, e.g., using a standard 
EL ISA assay. 

Alternative to preparing monoclonal antibody- 
secreting hybridomas, a monoclonal anti-Tango-77 antibody 
can be identified and isolated by screening a recombinant 

10 combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with Tango-77 to thereby isolate 
immunoglobulin library members that bind Tango-77. Kits 
for generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 

15 Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP m Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 

20 U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 

25 92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 
246:1275-1281; Griffiths et al, (1993) EMBO J 12:725-734. 
Additionally, recombinant anti-Tango-77 

30 antibodies, such as chimeric and humanized monoclonal 

antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be 

35 produced by recombinant DNA techniques known in the art, 
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for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,496; European Patent 
Application 173,494; PCT Publication No. WO 86/01533; 
5 U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al . (1988) Science 240:1041-1043; Liu 
et al. (1987) Proc. Natl. Acad. Sci. USA 84:3439-3443; 
Liu et al. (1987) J". Immunol. 139:3521-3526; Sun et al . 
(1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura 

10 et al. (1987) Cane. Res. 47:999-1005; Wood et al . (1985) 
Mature 314:446-449; and Shaw et al . (1988) J. Natl. 
Cancer Inst. 80:1553-1559); Morrison (1985) Science 
229:1202-1207; Oi et al . (1986) Bio/Techniques 4:214; 
U.S. Patent 5,225,539; Jones et al . (1986) Mature 

is 321:552-525; Verhoeyan et al. (1988) Science 239:1534; 
and Beidler et al. (1988) J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly 
desirable for therapeutic treatment of human patients. 
Such antibodies can be produced using transgenic mice 

20 which are incapable of expressing endogenous 

immunoglobulin heavy and light chains genes, but which 
can express human heave and light chain genes. The 
transgenic mice are immunized in the normal fashion with 
a selected antigen, e.g., all or a portion of Tango-77. 

25 Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 

30 mutation. Thus, using such a technique, it is possible 
to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Int. Rev. Immunol. 13:65-93). For a detailed discussion 

35 of this technology for producing human antibodies and 
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human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
5 companies such as Abgenix, Inc. (Freemont, CA> , can be 
engaged to provide human antibodies directed against a 
selected antigen using technology similar to the 
described above. 

Completely human antibodies which recognize a 

10 selected epitope can be generated using a technique 
referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine 
antibody, is used to guide the selection of a completely 
human antibody recognizing the same epitope. 

is First, a non-human monoclonal antibody which binds 

a selected antigen (epitope), e.g., an antibody which 
inhibits Tango-77 activity, is identified. The heave 
chain and the light chain of the non-human antibody are 
cloned and used to create phage display Fab fragments. 

20 For example, the heave chain gene can be cloned into a 
plasmid vector so that the heavy chain can be secreted 
from bacteria. The light chain gene can be cloned into a 
phage coat protein gene so that the light chain can be 
expressed on the surface of phage. A repertoire (random 

2 5 collection) of human light chains fused to phage is used 
to infect the bacteria which express the non- human heavy 
chain. The resulting progeny phage display hybrid 
antibodies (human light chain/non-human heavy chain) . 
The selected antigen is used in a panning screen to 

30 select phage which bind the selected antigen. Several 
rounds of selection may be required to identify such 
phage. Next, human light chain genes are isolated from 
the selected phage which bind the selected antigen. 
These selected human light chain genes are then used to 

35 guide the selection of human heavy chain genes as 
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follows. The selected human light chain genes are 
inserted into vectors for expression by bacteria. 
Bacteria expressing the selected human light chains are 
infected with a repertoire of human heavy chains fused to 
5 phage. The resulting progeny phage display human 
antibodies (human light chain/human heavy chain) . 

Next, the selected antigen is used in a panning 
screen to select phage which bind the selected antigen. 
The phage selected in this step display completely human 

10 antibody which recognize the same epitope recognized by 
the original selected, non-human monoclonal antibody. 
The genes encoding both the heavy and light chains are 
readily isolated and be further manipulated for 
production of human antibody. This technology is 

is described by Jespers et al. (1994, Bio/ technology 12:899- 
903) . 

An anti-Tango-77 antibody (e.g., monoclonal 
antibody) can be used to isolate Tango- 77 by standard 
techniques, such as affinity chromatography or 

20 immunoprecipitation. An anti-Tango-77 antibody can 
facilitate the purification of natural Tango-77 from 
cells and of recombinantly produced Tango-77 expressed in 
host cells. Moreover, an anti-Tango-77 antibody can be 
used to detect Tango-77 protein (e.g., in a cellular 

25 lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the Tango-77 
protein. Anti-Tango-77 antibodies can be used 
diagnostically to monitor protein levels in tissue as 
part of a clinical testing procedure, e.g., to, for 

30 example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 

35 bioluminescent materials, and radioactive materials. 
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Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, /3-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
5 avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone , fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol; examples of 
10 bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 I, U1 I, 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 
Another aspect of the invention pertains to 

is vectors, preferably expression vectors, containing a 
nucleic acid molecule encoding Tango-77 (or a portion 
thereof) , As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of 

20 vector is a "plasmid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 

25 replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 
vectors (e.g i# non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 

30 introduction into the host cell, and thereby are 

replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 
expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
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DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
s adenoviruses and adeno-associated viruses) , which serve 
equivalent functions. 

The recombinant expression vectors of the 
invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 

10 host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 
expression, which is operably linked to the nucleic acid 
sequence to be expressed. Within a recombinant 

is expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 

20 when the vector is introduced into the host cell) . The 
term "regulatory sequence 11 is intended to include 
promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such 
regulatory sequences are described, for example, in 

25 Goeddel; Gene Expreaaion Technology: Methods in 

Enzymology 185, Academic Press, San Diego, CA (1990) . 
Regulatory sequences include those which direct 
constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 

30 the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 
of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 

35 of expression of protein desired, etc. The expression 
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vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., Tango-77 proteins, mutant forms 
5 of Tango-77, fusion proteins, etc.). 

The recombinant expression vectors of the 
invention can be designed for expression of Tango-77 in 
prokaryotic or eukaryotic cells, e.g., bacterial cells 
such as E. coli, insect cells (using baculovirus 

10 expression vectors), yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology' Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990) . Alternatively, the 
recombinant expression vector can be transcribed and 

15 translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most 
often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 

20 expression of either fusion or non- fusion proteins. 

Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 

25 protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by aeting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 

30 of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 

35 Typical fusion expression vectors include pGEX (Pharmacia 
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Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST) , maltose E binding protein, or protein 
5 A, respectively, to the target recombinant protein. 

Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al . (1988) Gene 
69:301-315) and pET lid (Studier et al., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 

10 San Diego, California (1990) 60-89) . Target gene 
expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion 
promoter. Target gene expression from the pET lid vector 
relies on transcription from a T7 gnlO-lac fusion 

is promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21 (DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

20 One strategy to maximize recombinant protein 

expression in E. coli is to express the protein in a host 
bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 

25 Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 
the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al. 

30 (1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the Tango- 77 expression 
vector is a yeast expression vector. Examples of vectors 

35 for expression in yeast S. cerivisae include pYepSecl 
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(Baldari et al. (1987) EMBO J. 6:229-234), pMFa (Kurjan 
and Herskowitz (1982) Cell 30 : 933-943) , pJRY88 (Schultz 
et al. (1987) Gene 54:113-123), pYES2 (Invitrogen 
Corporation, San Diego, CA) , and picZ (InVitrogen Corp, 
5 San Diego, CA) . 

Alternatively, Tango-77 can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series 
10 (Smith et al . (1983) Mol. Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31- 
39) . 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 

is mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 

20 regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al, (supra) . 

25 In another embodiment, the recombinant mammalian 

expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
express the nucleic acid) . Tissue-specific regulatory 

30 elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 
promoter (liver-specific; Pinkert et al. (1987) Genes 
Dev. 1:268-277), lymphoid- specif ic promoters (Calame and 
Eaton (1988) Adv. Immunol. 43:235-275), in particular 

35 promoters of T cell receptors (Winoto and Baltimore 
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(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron- specif ic promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc. 
5 Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific 

promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmental ly- 

10 regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the a-f etoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant 

is expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 
operably linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 

20 molecule) of an RNA molecule which is antisense to 

Tango- 77 mRNA. Regulatory sequences operably linked to a 
nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 

25 instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 

30 which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 
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Weintraub et al . (Reviews - Trends in Genetics, Vol. 1(1) 
1986) . 

Another aspect of the invention pertains to host 
cells into which a recombinant expression vector of the 

5 invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 

10 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein. 

15 A host cell can be any prokaryotic or eukaryotic 

cell. For example, Tango-77 protein can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells) . Other suitable host cells are known 
20 to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer 
25 to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co- 
precipitation, DEAE-dextran-mediated transfection, 
lipofection, or elect roporation. Suitable methods for 
30 transforming or transfecting host cells can be found in 
Sambrook, et al . (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 
known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
35 cells may integrate the foreign DNA into their genome. 
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In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 

5 markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate- Nucleic acid 
encoding a selectable marker can be introduced into a 
host cell on the same vector as that encoding Tango- 77 or 
can be introduced on a separate vector. Cells stably 

10 transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention/ such as a 

is prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) Tango-77 protein. 
Accordingly, the invention further provides methods for 
producing Tango-77 protein using the host cells of the 
invention. In one embodiment, the method comprises 

20 culturing the host cell of invention (into which a 

recombinant expression vector encoding Tango-77 has been 
introduced) in a suitable medium such that Tango-77 
protein is produced. In another embodiment, the method 
further comprises isolating Tango-77 from the medium or 

25 the host cell. 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 
one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which 

30 Tango- 77 -coding sequences have been introduced. Such 
host cells can then be used to create non-human 
transgenic animals in which exogenous Tango-77 sequences 
have been introduced into their genome or homologous 
recombinant animals in which endogenous Tango-77 

35 sequences have been altered. Such animals are useful for 
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studying the function and/or activity of Tango-77 and for 
identifying and/or evaluating modulators of Tango-77 
activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a 
5 rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other 
examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, 
etc. A transgene is exogenous DNA which is integrated 

10 into the genome of a cell from which a transgenic animal 
develops and which remains in the genome of the mature 
animal, thereby directing the expression of an encoded 
gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous 

is recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous 
Tango-77 gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 

20 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal, 

A transgenic animal of the invention can be 
created by introducing Tango- 77 -encoding nucleic acid 
into the male pronuclei of a fertilized oocyte, e.g., by 

25 microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. The Tango-77 cDNA sequence e.g., that of (SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:6; SEQ ID NO:10 or the cDNA 
of ATCC 98807) can be introduced as a transgene into the 

30 genome of a non-human animal. Alternatively, a nonhuman 
homologue of the human Tango-77 gene, such as a mouse 
Tango-77 gene, can be isolated based on hybridization to 
the human Tango-77 cDNA and used as a transgene. 
Intronic sequences and polyadenylation signals can also 

35 be included in the transgene to increase the efficiency 
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of expression of the transgene. A tissue-specific 
regulatory sequence (s) can be operably linked to the 
Tango- 77 transgene to direct expression of Tango-77 
protein to particular cells. Methods for generating 
5 transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 

10 Mouse Embryo (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the Tango-77 transgene in its genome and/or expression 

is of Tango-77 mRNA in tissues or cells of the animals. A 
transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding Tango-77 
can further be bred to other transgenic animals carrying 

20 other transgenes. 

To create an homologous recombinant animal, a 
vector is prepared which contains at least a portion of a 
Tango-77 gene (e.g., a human or a non-human homolog of 
the Tango-77 gene, e.g., a murine Tango-77 gene) into 

25 which a deletion, addition or substitution has been 

introduced to thereby alter, e.g., functionally disrupt, 
the Tango-77 gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the 
endogenous Tango-77 gene is functionally disrupted (i.e., 

30 no longer encodes a functional protein; also referred to 
as a "knock out" vector) . Alternatively, the vector 

can be designed such that, upon homologous recombination, 
the endogenous Tango-77 gene is mutated or otherwise 
altered but still encodes functional protein (e.g., the 

35 upstream regulatory region can be altered to thereby 
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alter the expression of the endogenous Tango- 77 protein) . 
In the homologous recombination vector, the altered 
portion of the Tango- 77 gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the Tango-77 gene to 
5 allow for homologous recombination to occur between the 
exogenous Tango-77 gene carried by the vector and an 
endogenous Tango-77 gene in an embryonic stem cell. The 
additional flanking Tango-77 nucleic acid is of 
sufficient length for successful homologous recombination 

10 with the endogenous gene. Typically, several kilobases 
of flanking DNA (both at the 5' and 3' ends) are included 
in the vector {see, e.g., Thomas and Capecchi (1987) Cell 
51:503 for a description of homologous recombination 
vectors) . The vector is introduced into an embryonic 

is stem cell line (e.g., by electroporation) and cells in 
which the introduced Tango-77 gene has homologously 
recombined with the endogenous Tango-77 gene are selected 
{see, e.g., Li et al. (1992) Cell 69:915). The selected 
cells are then injected into a blastocyst of an animal 

20 (e.g., a mouse) to form aggregation chimeras {see, e.g., 
Bradley in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed. (IRL, Oxford, 1987) 
pp. 113-152) . A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and 

25 the embryo brought to term. Progeny harboring the 

homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 

30 homologous recombination vectors and homologous 

recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-human 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
5 system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
(1992) Proc. Natl. Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (0' Gorman et al. 

10 (1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 
animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such 
animals can be provided through the construction of 

is "double" transgenic animals, e.g., by mating two 

transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 

20 described herein can also be produced according to the 
methods described in Wilmut et al. (1997) Nature 385 : 810- 
813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 

25 growth cycle and enter G Q phase. The quiescent cell can 
then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 

30 develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated. 
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IV. Pharmaceutical Compositions 

The Tango-77 nucleic acid molecules, Tango-77 
proteins, and anti-Tango-77 antibodies (also referred to 
herein as "active compounds") of the invention can be 
5 incorporated into pharmaceutical compositions suitable 
for administration. Such compositions typically comprise 
the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" is 

10 intended to include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use 
of such media and agents for pharmaceutically active 

is substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the 
active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

20 A pharmaceutical composition of the invention is 

formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, (e.g. intravenous, intradermal, 
subcutaneous) (e.g., oral inhalation), transdermal 

25 (topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 

30 glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 

35 acetates, citrates or phosphates and agents for the 
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adjustment of tonicity such as sodium chloride or 
dextrose, pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
5 syringes or multiple dose vials made of glass or plastic. 
Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable 

10 solutions or dispersions. For intravenous 

administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF; 
Parsippany, NJ) or phosphate buffered saline (PBS) » In 
all cases, the composition must be sterile and should be 

is fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, 

20 for example, water, ethanol, polyol (for example, 
glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the like), and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by 
the use of a coating such as lecithin, by the maintenance 

25 of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, 
parabens, chlorobutanol , phenol, ascorbic acid, 

30 thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, 
sugars, polyalcohols such as mannitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including 
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in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by- 
incorporating the active compound (e.g., a Tango-77 
5 protein or anti-Tango-77 antibody) in the required amount 
in an appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a 

10 sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and 

is f reeze-drying which yields a powder of the active 

ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 

20 gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 

25 for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutical ly compatible binding agents, 
and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 

30 the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 

35 lubricant such as magnesium stearate or Sterotes; a 
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glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring . 

5 For administration by inhalation, the compounds 

are delivered in the form of an aerosol spray from a 
pressurized container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 
io Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 
is include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 
20 formulated into ointments, salves, gels, or creams as 
generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
25 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 
30 microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
35 those skilled in the art. The materials can also be 
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obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutical^ acceptable carriers. These can be 
prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 
10 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 
15 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 
20 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors . 
25 Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
{see, e.g., Chen et al . (1994) Proc. Natl. Acad. Sci. USA 
91:3054-3057) . The pharmaceutical preparation of the 
30 gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 
35 retroviral vectors, the pharmaceutical preparation can 


WO 99/06426 


PCT/US98/16102 


- 60 - 

include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 

10 assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) ; c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). A 

15 Tango- 77 protein interacts with other cellular proteins 
and can thus be used for regulation of inflammation. The 
polypeptides of the invention can be used in assays to 
determine biological activity. For example, they could 
be used in a panel of proteins for high- throughput 

20 screening. 

The isolated nucleic acid molecules of the 
invention can be used to express Tango-77 protein (e.g., 
via a recombinant expression vector in a host cell in 
gene therapy applications) , to detect Tango-77 mRNA 

25 (e.g., in a biological sample) or a genetic lesion in a 
Tango-77 gene, and to modulate Tango-77 activity. In 
addition, the Tango-77 proteins can be used to screen 
drugs or compounds which modulate the Tango-77 activity 
or expression as well as to treat disorders characterized 

30 by insufficient or excessive production of Tango-77 
protein or production of Tango-77 protein forms which 
have decreased or aberrant activity compared to Tango-77 
wild type protein. In addition, the anti-Tango-77 
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antibodies of the invention can be used to detect and 
isolate Tango-77 proteins and modulate Tango-77 activity. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
5 uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to 

herein as a "screening assay") for identifying 

modulators, i.e., candidate or test compounds or agents 
10 (e.g., peptides, peptidomimetics, small molecules or 

other drugs) which bind to Tango-77 proteins or have a 

stimulatory or inhibitory effect on, for example, 

Tango-77 expression or Tango-77 activity. 

Examples of methods for the synthesis of molecular 
is libraries can be found in the art, for example in: 

DeWitt et al . (1993) Proc. Natl. Acad. Sci . USA 90:6909; 

Erb et al . (1994) Proc. Natl. Acad. Sci. USA 91:11422; 

Zuckermann et al . (1994). J. Med. Chem. 37:2678; Cho et 

al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
20 Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 

Chem. Int. Ed. Engl. 33:2061; and Gallop et al . (1994) J. 

Med. Chem. 37:1233. 

Libraries of compounds may be presented in 

solution (e.g., Houghten (1992) Bio/Techniques 13:412- 
25 421), or on beads (Lam (1991) Mature 354:82-84), chips 

(Fodor (1993) Nature 364 : 555-556) , bacteria (U.S. Patent 

No. 5,223,409), spores (Patent Nos. 5,571,698; 5,403,484; 

and 5,223,409), plasmids (Cull et al . (1992) Proc. Natl. 

Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
30 (1990) Science 249:386-390; Devlin (1990) Science 

249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 

USA 87:6378-6382; and Felici (1991) J. Mol . Biol. 

222 :301-310) . 


WO 99/06426 


PCT/US98/16102 


- 62 - 

In another embodiment, an assay is used to 
determine the ability of the test compound to modulate 
the activity of Tango-77 or a biologically active portion 
thereof, for example, by determining the ability of the 
5 Tango-77 protein to bind to or interact with a Tango-77 
target molecule. As used herein, a "target molecule" is 
a molecule with which a Tango-77 protein binds or 
interacts in nature, for example, a molecule on the 
surface of a cell. A Tango-77 target molecule can be a 

10 non-Tango- 77 molecule or a Tango-77 protein or 

polypeptide of the present invention. In one embodiment, 
a Tango-77 target molecule is a component of a signal 
transduction pathway, for example, Tango-77 may bind to a 
IL-1 receptor or another receptor thereby blocking the 

is receptor and inhibiting future signal transduction. 

Determining the ability of the Tango-77 protein to bind 
to or interact with a Tango-77 target molecule can be 
accomplished by one of the methods described above. In a 
preferred embodiment, determining the ability of the 

20 Tango-77 protein to bind to or interact with a Tango-77 
target molecule can be accomplished by determining the 
activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the 

25 target (e.g., intracellular Ca 2 \ diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a Tango- 77 -responsive 
regulatory element operably linked to a nucleic acid 

30 encoding a detectable marker, e.g. lucif erase) , or 

detecting a cellular response, for example, inflammation. 

In yet another embodiment, an assay of the present 
invention is a cell-free assay comprising contacting a 
Tango-77 protein or biologically active portion thereof 

35 with a test compound and determining the ability of the 
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test compound to bind to the Tango- 77 protein or 
biologically active portion thereof. Binding of the test 
compound to the Tango- 77 protein can be determined either 
directly or indirectly as described above. In a 
5 preferred embodiment, the assay includes contacting the 
Tango-77 protein or biologically active portion thereof 
with a known compound which binds Tango-77 to form an 
assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test 
10 compound to interact with a Tango-77 protein, wherein 
determining the ability of the test compound to interact 
with a Tango-77 protein comprises determining the ability 
of the test compound to preferentially bind to Tango-77 
or biologically active portion thereof as compared to the 
is known compound. 

In another embodiment, an assay is a cell -free 
assay comprising contacting Tango-77 protein or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
20 modulate (e.g., stimulate or inhibit) the activity of the 
Tango-77 protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of Tango-77 can be accomplished, for 
example, by determining the ability of the Tango-77 
25 protein to bind to a Tango-77 target molecule by one of 
the methods described above for determining direct 
binding. In an alternative embodiment, determining the 
ability of the test compound to modulate the activity of 
Tango-77 can be accomplished by determining the ability 
30 of the Tango-77 protein to further modulate a Tango-77 
target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
35 comprises contacting the Tango-77 protein or biologically 
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active portion thereof with a known compound which binds 
Tango- 77 to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability 
of the test compound to interact with a Tango- 77 protein, 
5 wherein determining the ability of the test compound to 
interact with a Tango- 77 protein comprises determining 
the ability of the Tango- 77 protein to preferentially 
bind to or modulate the activity of a Tango- 77 target 
molecule . 

10 It is possible that membrane -bound forms of Tango- 

77 exist. The cell -free assays of the present invention 
are amenable to use of both the forms Tango-77. In the 
case of cell -free assays comprising a membrane -bound form 
of Tango-77, it may be desirable to utilize a 

is solubilizing agent such that the membrane -bound form of 
Tango- 77 is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as 
n-octylglucoside , n-dodecylglucoside , n-dodecylmaltoside , 
octanoyl-N-methylglucamide, . decanoyl-N-methylglucamide, 

20 Triton® X-100, Triton® X-114, Thesit®, 

Isotridecypoly (ethylene glycol ether)n, 3- [(3- 
cholamidopropyl) dimethylamminio] -1-propane sulfonate 
(CHAPS) , 3- [ ( 3 -cholamidopropyl) dimethylamminio] -2- 
hydroxy- 1-propane sulfonate (CHAPSO) , or N-dodecyl=N,N- 

25 dimethyl-3-ammonio-l-propane sulfonate. 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 
immobilize either Tango-77 or its target molecule to 
facilitate separation of complexed from uncomplexed forms 

30 of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to 
Tango-77, or interaction of Tango-77 with a target 
molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for 

35 containing the reactants. Examples of such vessels 
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include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein 
can be provided which adds a domain that allows one or 
both of the proteins to be bound to a matrix. For 
5 example, glutathione-S-transf erase/ Tango-77 fusion 
proteins or glutathione-S- transf erase/target fusion 
proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical Co.; St. Louis, MO) or glutathione 
derivatized microtitre plates, which are then combined 

io with the test compound or the test compound and either 
the non-adsorbed target protein or Tango- 77 protein, and 
the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for 
salt and pH) . Following incubation, the beads or 

is microtitre plate wells are washed to remove any unbound 
components and complex formation is measured either 
directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the 
matrix, and the level of Tango-77 binding or activity 

20 determined using standard techniques. 

Other techniques for immobilizing proteins on 
matrices can also be used in the screening assays of the 
invention. For example, either Tango-77 or its target 
molecule can be immobilized utilizing conjugation of 

25 biotin and streptavidin. Biotinylated Tango-77 or target 
molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL) , and immobilized in the wells of streptavidin-coated 

30 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with Tango-77 or target molecules but 
which do not interfere with binding of the Tango-77 
protein to its target molecule can be derivatized to the 
wells of the plate, and unbound target or Tango-77 

35 trapped in the wells by antibody conjugation. Methods 
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for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, 
include immunodetection of complexes using antibodies 
reactive with the Tango- 77 or target molecule, as well as 
s enzyme -linked assays which rely on detecting an enzymatic 
activity associated with the Tango-77 or target molecule. 

In another embodiment, modulators of Tango-77 
expression are identified in a method in which a cell is 
contacted with a candidate compound and the expression of 

10 Tango-77 mRNA or protein in the cell is determined. The 
level of expression of Tango-77 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of Tango-77 mRNA or protein in the 
absence of the candidate compound. The candidate 

15 compound can then be identified as a modulator of 
Tango-77 expression based on this comparison. For 
example, when expression of Tango-77 mRNA or protein is 
greater (statistically significantly greater) in the 
presence of the candidate compound than in its absence, 

20 the candidate compound is identified as a stimulator of 
Tango-77 mRNA or protein expression. Alternatively, when 
expression of Tango-77 mRNA or protein is less 
(statistically significantly less) in the presence of the 
candidate compound than in its absence, the candidate 

25 compound is identified as an inhibitor of Tango-77 mRNA 
or protein expression. The level of Tango-77 mRNA or 
protein expression in the cells can be determined by 
methods described herein for detecting Tango-77 mRNA or 
protein. 

30 In yet another aspect of the invention, the 

Tango-77 proteins can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al. (1993) Cell 72:223- 
232; Madura et al . (1993) J. Biol. Chem. 268:12046-12054; 

35 Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 


WO 99/06426 


PCT/US98/16102 


- 67 - 

et al. (1993) Oncogene 8:1693-1696; and PCT Publication 
No. WO 94/10300) , to identify other proteins, which bind 
to or interact with Tango-77 ( "Tango -77 -binding proteins" 
or "Tango- 77 -bp" ) and modulate Tango-77 activity. Such 
5 Tango- 77 -binding proteins are also likely to be involved 
in the propagation of signals by the Tango-77 proteins 
as, for example, upstream or downstream elements of the 
Tango-77 pathway. 

The two-hybrid system is based on the modular 

10 nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
the assay utilizes two different DNA constructs. In one 
construct, the gene that codes for Tango-77 is fused to a 
gene encoding the DNA binding domain of a known 

is transcription factor (e.g., GAL-4) . In the other 
construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" 
or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If 

20 the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an Tango- 77 -dependent complex, the DNA- 
binding and activation domains of the transcription 
factor are brought into close proximity. This proximity 
allows transcription of a reporter gene (e.g., LacZ) 

25 which is operably linked to a transcriptional regulatory 
site responsive to the transcription factor. Expression 
of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be 
isolated and used to obtain the cloned gene which encodes 

30 the protein which interacts with . Tango-77 . 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
uses thereof for treatments as described herein. 
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B. Detection Assays 3 
Portions or fragments of the cDNA sequence 
identified herein (and the corresponding complete gene 
sequences) can be used in numerous ways as polynucleotide 
5 reagents. For example, the sequence can be used to: (i) 
map the respective gene on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 
(tissue typing) ; and (iii) aid in forensic identification 
10 of a biological sample. These applications are described 
in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
of a gene has been isolated, this sequence can be used to 

is map the location of the gene on a chromosome. 

Accordingly, Tango-77 nucleic acid molecules described 
herein or fragments thereof, can be used to map the 
location of the Tango-77 gene(s) on a chromosome. The 
mapping of the Tango-77 sequences to chromosomes is an 

20 important first step in correlating these sequences with 
genes associated with disease. 

Briefly, a Tango-77 gene can be mapped tc 
chromosomes by preparing PCR primers (preferably 15-2 5 bp 
in length) from the Tango-77 sequences. Computer 

25 analysis of Tango-77 sequences can be used to rapidly 

select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. 
These primers can then be used for PCR screening of 
somatic cell hybrids containing individual human 

30 chromosomes. Only those hybrids containing the human 
gene corresponding to the Tango-77 sequences will yield 
an amplified fragment. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 
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mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 
random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow (because they lack 
a particular enzyme) but in which human cells can, the 
one human chromosome that contains the gene encoding the 
needed enzyme, will be retained. By using various media, 
panels of hybrid cell lines can be established. Each 
cell line in a panel contains either a single human 
chromosome or a small number of human chromosomes, and a 
full set of mouse chromosomes, allowing easy mapping of 
individual genes to specific human chromosomes. 
(D'Eustachio et al . (1983) Science 220:919-924).- Somatic 
cell hybrids containing only fragments of human 
chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 

PGR mapping of somatic cell hybrids is a rapid 
procedure for assigning a particular sequence to a 
particular chromosome. Three or more sequences can be 
o assigned per day using a single thermal cycler. Using 
the Tango-77 sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies 
which can similarly be used to map a Tango-77 sequence to 
5 its chromosome include in situ hybridization (described 
in Fan et al . (1990) Proc. Natl. Acad. Sci. USA 87:6223- 
27) , pre-screening with labeled flow-sorted chromosomes, 
and pre-selection by hybridization to chromosome specific 
cDNA libraries. 

o Fluorescence in situ hybridization (FISH) of a DNA 

sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical, 

5 e.g., colcemid that disrupts the mitotic spindle. The 
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chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 
develops on each chromosome, so that the chromosomes can 
be identified individually. The FISH technique can be 
5 used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. 
Preferably 1,000 bases, and more preferably 2,000 bases 

10 will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 

is individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 

20 sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal * location, the physical position of the 

25 sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 

30 same chromosomal region, can then be identified through 
linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Nature 
325:783-787. 

Moreover, differences in the DNA sequences between 
35 individuals affected and unaffected with a disease 
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associated with the Tango-77 gene can be determined. If 
a mutation is observed in some or all of the affected 
individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the 
5 particular disease. Comparison of affected and 

unaffected individuals generally involves first looking 
for structural alterations in the chromosomes such as 
deletions or translocations that are visible from 
chromosome spreads or detectable using PGR based on that 
10 DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the 
presence of a mutation and to distinguish mutations from 
polymorphisms . 

2. Tissue Typing 

is The Tango-77 sequences of the present invention 

can also be used to identify individuals from minute 
biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 

20 personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 

25 switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 
U.S. Patent 5,272,057). 

Furthermore, the sequences of the present 

30 invention can be used to provide an alternative technique 
which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
Tango-77 sequences described herein can be used to 
prepare two PCR primers from the 5' and 3' ends of the 
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sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 
individuals, prepared in this manner, can provide unique 
5 individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
individuals and from tissue. The Tango-77 sequences of 
10 the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the 
coding regions of these sequences, and to a greater 
degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a 
is frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used 
as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, 
20 fewer sequences are necessary to differentiate 

individuals. The noncoding sequences of SEQ ID N0:1 can 
comfortably provide positive individual identification 
with a panel of perhaps 10 to 1,000 primers which each 
yield a noncoding amplified sequence of 100 bases. If 
25 predicted coding sequences, such as those in SEQ ID NO: 3, 
SEQ ID NO:6, or SEQ ID NO: 10 are used, a more appropriate 
number of primers for positive individual identification 
would be 500-2, 000 . 

If a panel of reagents from Tango-77 sequences 
30 described herein is used to generate a unique 

identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 
positive identification of the individual, living or 
35 dead, can be made from extremely small tissue samples. 
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3. Use of Partial Tango-77 Sequences in Forensic 

Biology 

DNA-based identification techniques can also be 
used in forensic biology. Forensic biology is a 
5 scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 
make such an identification, PCR technology can be used 
to amplify DNA sequences taken from very small biological 
io samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 
standard, thereby allowing identification of the origin 
of the biological sample, 
is The sequences of the present invention can be used 

to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
identifications by, for example, providing another 
20 "identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments . 
25 Sequences targeted to noncoding regions of SEQ ID NO:l 
are particularly appropriate for this use as greater 
numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this 
technique. Examples of polynucleotide reagents include 
30 the Tango-77 sequences or portions thereof, e.g., 

fragments derived from the noncoding regions of SEQ ID 
NO:l having a length of at least 20 or 3 0 bases. 

The Tango-77 sequences described herein can 
further be used to provide polynucleotide reagents, e.g., 
35 labeled or labelable probes which can be used in, for 
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example, an in situ hybridization technique, to identify 
a specific tissue, e.g., brain tissue. This can be very 
useful in cases where a forensic pathologist is presented 
with a tissue of unknown origin. Panels of such Tango-77 

5 probes can be used to identify tissue by species and/or 
by organ type. 

In a similar fashion, these reagents, e.g., 
Tango-77 primers or probes can be used to screen tissue 
culture for contamination (i.e., screen for the presence 

io of a mixture of different types of cells in a culture) . 

C . Predictive Medicine 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics , and monitoring 

is clinical trails are used for prognostic (predictive) 

purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining Tango-77 protein 
and/or nucleic acid expression as well as Tango-77 

20 activity, in the context of a biological sample (e.g., 
blood, serum, cells, tissue) to thereby determine whether 
an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with 
aberrant Tango-77 expression or activity. The invention 

25 also provides for prognostic (or predictive) assays for 
determining whether an individual is at risk of 
developing a disorder associated with Tango-77 protein, 
nucleic acid expression or activity. For example, 
mutations in a Tango-77 gene can be assayed in a 

30 biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with 
Tango-77 protein, nucleic acid expression or activity. 
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Another aspect of the invention provides methods 
for determining Tango-77 protein, nucleic acid expression 
or Tango-77 activity in an individual to thereby select 
appropriate therapeutic or prophylactic agents for that 
individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g.; the genotype of the individual examined to 
o determine the ability of the individual to respond to a 
particular agent.) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of Tango-77 in 
s clinical trials. 

These and other agents are described in further 
detail in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
o absence of Tango-77 in a biological sample involves 
obtaining a biological sample from a test subject and 
contacting the biological sample with a compound or an 
agent capable of detecting Tango-77 protein or nucleic 
acid (e.g., mRNA, genomic DNA) that encodes Tango-77 
>5 protein such that the presence of Tango-77 is detected in 
the biological sample. A preferred agent for detecting 
Tango-77 mRNA or genomic DNA is a labeled nucleic acid 
probe capable of hybridizing to Tango-77 mRNA or genomic 
DNA. The nucleic acid probe can be, for example, a full- 
30 length Tango-77 nucleic acid, such as the nucleic acid of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10 or a 
portion thereof, such as an oligonucleotide of at least 
15, 30, 50, 100, 250 or 500 nucleotides in length and 
sufficient to specifically hybridize under stringent 
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10 


15 


20 


conditions to Tango-77 mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the 
invention are described herein. 

A preferred agent for detecting Tango-77 protein 
is an antibody capable of binding to Tango-77 protein, 
preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof 
(e.g., Fab or F(ab') a ) can be used. The term "labeled", 

. . i- a +-h<a r^r>K<* or* antibodv, is intended to 

encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with 
another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody 
using a f luorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be 
detected with f luorescently labeled streptavidin. The 
term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as 
well as tissues, cells and fluids present within a 
subject. That is, the detection method of the invention 
can be used to detect Tango-77 mRNA, protein, or genomic 
5 DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of 
Tango-77 mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of 
Tango-77 protein include enzyme linked immunosorbent 
o assays (ELISAs) , Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of 
Tango-77 genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of Tango-77 
protein include introducing into a subject a labeled 
: 5 anti-Tango-77 antibody. For example, the antibody can be 
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labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging 
techniques 

In one embodiment, the biological sample contains 
5 protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 

10 from a subject. 

In another embodiment, the methods further involve 
obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting Tango-77 protein, mRNA, or 

is genomic DNA, such that the presence of Tango-77 protein, 
mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of Tango-77 protein, mRNA or 
genomic DNA in the control sample with the presence of 
Tango-77 protein, mRNA or genomic DNA in the test sample. 

20 The invention also encompasses kits for detecting 

the presence of Tango-77 in a biological sample (a test 
sample) . Such kits can be used to determine if a subject 
is suffering from or is at increased risk of developing a 
disorder associated with aberrant expression of Tango-77 

25 (e.g., an -immunological disorder) . For example, the kit 
can comprise a labeled compound or agent capable of 
detecting Tango-77 protein or mRNA in a biological sample 
and means for determining the amount of Tango-77 in the 
sample (e.g., an anti-Tango-77 antibody or an 

30 oligonucleotide probe which binds to DNA encoding 
Tango-77, e.g., SEQ ID NO:l or SEQ ID N0:3 or SEQ ID 
N0:6, or SEQ ID NO:10). Kits may also include 
instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 

35 associated with aberrant expression of Tango-77 if the 
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amount of Tango- 77 protein or mRNA is above or below a 
normal level. 

For antibody-baaed kits, the kit may comprise, for 
example: (1) a first antibody (e.g., attached to a solid 

3 support) which binds to Tango-77 protein; and, optionally 
(2) a second, different antibody which binds to Tango-77 
protein or the first antibody and is conjugated to a 
detectable agent. 

For oligonucleotide-based kits, the kit may 

10 comprise, for example: (1) an oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a Tango-77 nucleic acid sequence or (2) a pair of primers 
useful for amplifying a Tango-77 nucleic acid molecule; 
The kit may also comprise, e.g., a buffering 

is agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 

20 compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 
single package along with instructions for observing 
whether the tested subject is suffering from or is at 

25 risk of developing a disorder associated with aberrant 
expression of Tango-77. 

2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
30 subjects having or at risk of developing a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. For example, the assays described herein, such 
as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or 
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at risk of developing a disorder associated with aberrant 
expression or activity. Thus, the present invention 
provides a method in which a test sample is obtained from 
a subject and Tango-77 protein or nucleic acid (e.g., 
5 mRNA, genomic DNA) is detected, wherein the presence of 
Tango- 77 protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or 
disorder associated with aberrant Tango- 77 expression or 
activity. As used herein, a "test sample" refers to a 

10 biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described 
herein can be used to determine whether a subject can be 

is administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant Tango- 77 expression or 
activity. For example, such methods can be used to 

20 determine whether a subject can be effectively treated 
with a specific agent or class of agents (e.g., agents of 
a type which decrease Tango- 77 activity) . Thus, the 
present invention provides methods for determining 
whether a subject can be effectively treated with an 

25 agent for a disorder associated with aberrant Tango-77 
expression or activity in which a test sample is obtained 
and Tango-77 protein or nucleic acid is detected (e.g., 
wherein the presence of Tango-77 protein or nucleic acid 
is diagnostic for a subject that can be administered the 

30 agent to treat a disorder associated with aberrant 
Tango-77 expression or activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a Tango-77 gene, 
thereby determining if a subject with the lesioned gene 

35 is at risk for a disorder characterized by aberrant 
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inflammation. In preferred embodiments, the methods 
include detecting, in a sample of cells from the subject, 
the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting 

5 the integrity of a gene encoding a Tango-77-protein, or 
the mis -express ion of the Tango- 77 gene. For example, 
such genetic lesions or mutations can be detected by 
ascertaining the existence of at least one of: 1) a 
deletion of one or more nucleotides from a Tango- 77 gene; 

io 2) an addition of one or more nucleotides to a Tango-77 
gene; 3) a substitution of one or more nucleotides of a 
Tango-77 gene; 4) a chromosomal rearrangement of a 
Tango-77 gene; 5) an alteration in the level of a 
messenger RNA transcript of a Tango-77 gene; 6) an 

is aberrant modification of a Tango-77 gene, such as of the 
methylation pattern of the genomic DNA; 7) the presence 
of a non-wild type splicing pattern of a messenger RNA 
transcript of a Tango-77 gene; 8) a non-wild type level 
of a Tango-77-protein; 9) an allelic loss of a Tango-77 

20 gene, and 10) an inappropriate post-translational 

modification of a Tango- 77 -protein. As described herein, 
there are a large number of assay techniques known in the 
art which can be used for detecting lesions or mutations 
in a Tango-77 gene. A preferred biological sample is a 

25 peripheral blood leukocyte sample isolated by 
conventional means from a subject. 

In certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 
reaction (PGR) {see, e.g., U.S. Patent Nos. 4,683,195 and 

30 4,683,202), such as anchor PCR or RACE PCR, or, 

alternatively, in a ligation chain reaction (LCR) {see, 
e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc. Natl. Acad. Sci. USA 91:360- 
364) , the latter of which can be particularly useful for 

35 detecting point mutations in the Tango-77-gene {see, 
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e.g., Abravaya et al . (1995) Nucleic Acids Res. 23:675- 
682) . This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the 

5 sample, contacting the nucleic acid sample with one or 
more primers which specifically hybridize to a Tango-77 
gene under conditions such that hybridization and 
amplification of the Tango-77-gene (if present) occurs, 
and detecting the presence or absence of an amplification 

10 product, or detecting the size of the amplification 

product and comparing the length to a control sample. It 
is anticipated that PCR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 

is described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al. (1990) 
Proc. Natl. Acad. Sci . C7SA 87 : 1874-1878) , transcriptional 
amplification system (Kwoh, et al . (1989) Proc, Natl. 

20 Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 

25 schemes are especially useful for the detection of 

nucleic acid molecules if such molecules are present in 
very low numbers. 

In an alternative embodiment, mutations in a 
Tango-77 gene from a sample cell can be identified by 

30 alterations in restriction enzyme cleavage patterns. For 
example, sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucleases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 

35 fragment length sizes between sample and control DNA 
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indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme 
5 cleavage site. 

In other embodiments, genetic mutations in 
Tango- 77 can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of 
10 oligonucleotides probes (Cronin et al • (1996) Human 

Mutation 7:244-255; Kozal et al . (1996) Nature Medicine 
2:753-759) . For example, genetic mutations in Tango-77 
can be identified in two-dimensional arrays containing 
light -generated DNA probes as described in Cronin et al . 
is supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the 
sequences by making linear arrays of sequential 
overlapping probes. This step allows the identification 
20 of point mutations. This step is followed by a second 
hybridization array that allows the characterization of 
specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel 
25 probe sets, one complementary to the wild-type gene and 
the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the Tango-77 gene and detect mutations 
30 by comparing the sequence of the sample Tango-77 with the 
corresponding wild-type (control) sequence. Examples of 
sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc. Natl. Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. 
35 Sci. USA 74:5463). It is also contemplated that any of a 
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variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ( (1995) 
Bio/Techniques 19:448), including sequencing by mass 
spectrometry (see, e.g., PCT Publication No. WO 94/16101; 
Cohen et al . (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al. (1993) Appl. Biochem. Biotechnol. 38:147- 
159) . 

Other methods for detecting mutations in the 
Tango-77 gene include methods in which protection from 
cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al . (1985) 
Science 230:1242). In general, the technique of 
"mismatch cleavage" entails providing heteroduplexes 
formed by hybridizing (labeled) RNA or DNA containing the 
wild-type Tango-77 sequence with potentially mutant RNA 
or DNA obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which cleaves 
single- stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and 
sample strands. RNA/DNA duplexes can be treated with 
RNase to digest mismatched regions, and DNA/DNA hybrids 
can be treated with SI nuclease to digest mismatched 
regions. In other embodiments, either DNA/DNA or RNA/DNA 
duplexes can be treated with hydroxylamine or osmium 
tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size 
on denaturing polyacrylamide gels to determine the site 
of mutation. See, e.g., Cotton et al . (1988) Proc. Natl. 
Acad. Sci. USA 85:4397; Saleeba et al . (1992) Methods 
Enzymol. 217:286-295. In a preferred embodiment, the 
control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 
mismatched base pairs in double -stranded DNA (so called 
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W DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in Tango-77 cDNAs 
obtained from samples of cells. For example, the mutY 
enzyme of E. coli cleaves A at G/A mismatches and the 
5 thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al . (1994) Carcinogenesis 15:1657- 
1662) . According to an exemplary embodiment, a probe 
based on a Tango-77 sequence, e.g., a wild-type Tango-77 
sequence, is hybridized to a cDNA or other DNA product 

io from a test celi(s). The duplex is treated with a DNA 
mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or 
the like. See, e.g., U.S. Patent No. 5,459,039. 
In other embodiments, alterations in 

15 electrophoretic mobility will be used to identify 

mutations in Tango-77 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant 
and wild type nucleic acids (Orita et al. (1989) Proc. 

20 Natl. Acad. Sci. USA 86:2766; see also Cotton (1993) 

Mutat. Res. 285:125-144; Hayashi (1992) Genet Anal Tech 
Appl 9:73-79) . Single- stranded DNA fragments of sample 
and control Tango-77 nucleic acids will be denatured and 
allowed to renature. The secondary structure of single- 

25 stranded nucleic acids varies according to sequence, and 
the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by 

30 using RNA (rather than DNA) , in which the secondary 

structure is more sensitive to a change in sequence. In 
a preferred embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in 
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electrophoretic mobility (Keen et al . (1991) Trends Genet 
7:5) . 

In yet another embodiment, the movement of mutant 
or wild- type fragments in polyacrylamide gels containing 
5 a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Mature 313:495). When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 

10 approximately 40 bp of high-melting GC-rich DNA by PCR. 
in a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys. Chem. 265:12753). 

is Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

20 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl. Acad. 
Sci. USA 86:6230). Such allele specific oligonucleotides 

25 are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification 

30 technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

35 differential hybridization) (Gibbs et al . (1989) Nucleic 
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Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Proasner (1993) 
Tibtech 11:238). In addition, it may be desirable to 
5 introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

10 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

is amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

20 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a Tango- 77 gene. 

Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which Tango- 77 is 

25 expressed may be utilized in the prognostic assays 
described herein. 

3 . Pharmacogenomics 

Agents, or modulators which have a stimulatory or 
30 inhibitory effect on Tango-77 activity (e.g., Tango-77 
gene expression) as identified by a screening assay 
described herein can be administered to individuals to 
treat (prophylactically or therapeutically) disorders 
(e.g., acute or chronic inflammation and asthma) 
35 associated with aberrant Tango-77 activity. In 
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conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 
5 considered. Differences in metabolism of therapeutics 
can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 

io selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 

is Accordingly, the activity of Tango-77 protein, expression 
of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. 

20 Pharmacogenomics deals with clinically significant 

hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 
43 (2) :254-266. In general, two types of pharmacogenetic 

25 conditions can be differentiated. Genetic conditions 

transmitted as a single factor altering the way drugs act 
on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering 
the way the body acts on drugs are referred to as 

30 "altered drug metabolism". These pharmacogenetic 
conditions can occur either as rare defects or as 
polymorphisms. For example, glucose -6 -phosphate 
dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is 

35 haemolysis after ingestion of oxidant drugs (anti- 
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malar ials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 
5 the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 

io effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 

15 different among different populations. For example, the 
gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 

20 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, PM shows no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6- formed metabolite morphine. The other 

25 extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of Tango- 77 protein, expression 

30 of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 

35 of polymorphic alleles encoding drug-metabolizing enzymes 
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to the identification of an individual's drug 
responsiveness phenotype. This knowledge, when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
s prophylactic efficiency when treating a subject with a 
Tango-77 modulator, such as a modulator identified by one 
of the exemplary screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 

10 compounds) on the expression or activity of Tango-77 

(e.g., the ability to modulate aberrant inflammation) can 
be applied not only in basic drug screening, but also in 
clinical trials. For example, the effectiveness of an 
agent, as determined by a screening assay as described 

is herein, to increase Tango-77 gene expression, increase 
protein levels, or upregulate Tango-77 activity, can be 
monitored in clinical trials of subjects exhibiting 
decreased Tango-77 gene expression, decreased protein 
levels, or downregulated Tango-77 activity. 

20 Alternatively, the effectiveness of an agent, as 

determined by a screening assay, to decrease Tango-77 
gene expression, decrease protein levels, or downregulate 
Tango-77 activity, can be monitored in clinical trials of 
subjects exhibiting increased Tango-77 gene expression, 

25 increased protein levels, or upregtilated Tango-77 
activity. 

For example, and not by way of limitation, genes, 
including Tango-77, that are modulated in cells by 
treatment with an agent (e.g., compound, drug or small 
30 molecule) which modulates Tango-77 activity (e.g., as 
identified in a screening assay described herein) can be 
identified. Thus, to study the effect of agents on 
cellular proliferation disorders, for example, in a 
clinical trial, cells can be isolated and RNA prepared 
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and analyzed for the levels of expression of Tango- 77 and 
other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
5 described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 
described herein, or by measuring the levels of activity 
of Tango-77 or other genes. In this way, the gene 
expression pattern can serve as a marker, indicative of 

10 the physiological response of the cells to the agent. 
Accordingly, this response state may be determined 
before, and at various points during, treatment of the 
individual with the agent. 

In a preferred embodiment, the present invention 

is provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 
acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 

20 steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a Tango-77 protein, 
mRNA, or genomic DNA in the preadministration sample; 
(iii) obtaining one or more post -administration samples 

25 from the subject; (iv) detecting the level of expression 
or activity of the Tango-77 protein, mRNA, or genomic DNA 
in the post-administration samples; (v) comparing the 
level of expression or activity of the Tango-77 protein, 
mRNA, or genomic DNA in the pre-administration sample 

30 with the Tango-77 protein, mRNA, or genomic DNA in the 
post administration sample or samples; and (vi) altering 
the administration of the agent to the subject 
accordingly. For example, increased administration of 
the agent may be desirable to increase the expression or 

35 activity of Tango-77 to higher levels than detected, 
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i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may 
be desirable to decrease expression or activity of 
Tango-77 to lower levels than detected, i.e., to decrease 
5 the effectiveness of the agent. 

C. Methods of Treatment 
The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) developing or 

10 having a disorder associated with aberrant Tango-77 
expression or activity. Alternatively, disorders 
associated with aberrant IL-1 production can be treated 
with Tango-77. Such disorders include acute and chronic 
inflammation, asthma, some classes of arthritis, 

is autoimmune diabetes, systemic lupus erythematosus and 
inflammatory bowel disease. 

1 . Prophylactic Methods 
In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 

20 associated with an aberrant Tango-77 expression or 

activity (or aberrant IL-1 expression or activity) , by 
administering to the subject an agent which modulates 
Tango-77 expression or at least one Tango-77 activity. 
Subjects at risk for a disease which is caused or 

25 contributed to by aberrant Tango-77 expression or 
activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent 
can occur prior to the manifestation of symptoms 

30 characteristic of the Tango-77 aberrancy, such that a 
disease or disorder is prevented or, alternatively, 
delayed in its progression. Depending on the type of 
Tango-77 aberrancy, for example, a Tango-77 agonist or 
Tango-77 antagonist agent can be used for treating the 
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subject. The appropriate agent can be determined based 
on screening assays described herein. 


2 . Therapeutic Methods 

Another aspect of the invention pertains to 
5 methods of modulating Tango- 77 expression or activity for 
therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that 
modulates one or more of the activities of Tango-77 
protein activity associated with the cell. An agent that 

10 modulates Tango-77 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of a Tango-77 protein, 
a peptide, a Tango-77 peptidomimetic, or other small 
molecule. In one embodiment, the agent stimulates one or 

is more of the biological activities of Tango-77 protein. 
Examples of such stimulatory agents include active 
Tango-77 protein and a nucleic acid molecule encoding 
Tango-77 that has been introduced into the cell. In 
another embodiment, the agent inhibits one or more of the 

20 biological activities of Tango-77 protein. Examples of 
such inhibitory agents include antisense Tango-77 nucleic 
acid molecules and anti -Tango-77 antibodies. These 
modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in 

25 vivo (e.g, by administering the agent to a subject). As 
such, the present invention provides methods of treating 
an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a 
Tango-77 protein or nucleic acid molecule. In one 

30 embodiment, the method involves administering an agent 
(e.g., an agent identified by a screening assay described 
herein), or combination of agents that modulates (e.g., 
upregulates or downregulates) Tango-77 expression or 
activity. In another embodiment, the method involves 
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administering a Tango- 77 protein or nucleic acid molecule 
as therapy to compensate for reduced or aberrant Tango- 7 7 
expression or activity. 

Stimulation of Tango-77 activity is desirable in 
5 situations in which Tango-77 is abnormally downregulated 
and/or in which increased Tango-77 activity is likely to 
have a beneficial effect. Conversely, inhibition of 
Tango-77 activity is desirable in situations in which 
Tango-77 is abnormally upregulated and/or in which 

10 decreased Tango-77 activity is likely to have a 
beneficial effect. 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

is published patent applications cited throughout this 
application are hereby incorporated by reference. 

EXAMPLES 

Example 1: Isolation and Characterization of Human 
Tango-77 cDNAs 

20 Cytokine genes IL-la, IL-l/J and IL-lra have been 

found to be closely clustered on chromosome 2, i.e., 
IL-la, IL-1/3 and IL-lra are located within 450 kb of each 
other. BAC clones containing IL-la and IL-l/? were used 
to identify other proximal unknown cytokine genes. To do 

25 this, a BAC clone containing IL-la and IL-1/3 was selected 
from a BAC library (Research Genetics, Huntsville, 
Alabama) using specific primers designed against IL-la 
and IL-10. The DNA from the BAC was extracted and used 
to make a random- sheared genomic library. From this BAC 

30 library, 4000 clones were selected for sequencing. The 
resulting genomic sequences were then assembled into 
contigs and used to screen proprietary and public data 
bases. One genomic contig was found to contain two 
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segments of sequences which resemble IL-lra. These two 
segments are potential exons of Tango- 77 gene. 

Two PCR primers were then designed from the two 
potential exons and used to screen a panel of cDNA 

5 libraries for the expression of a Tango-77 message. A 
cDNA library from TNF-a treated human lung epithelia 
showed a positive band of the predicted size (i.e., if 
the two exons are spliced together) . Using the PCR 
fragment as a probe, a single cDNA clone was isolated 

10 from the same library. This cDNA contains an insert of 
989 bp. The cDNA clone contains three possible open 
reading frames. The first open reading frame encompasses 
534 nucleotides (nucleotides 3 56-88 9 of SEQ ID NO:l; SEQ 
ID NO: 3) and encodes a 178 amino acid protein (SEQ ID 

15 NO: 2) . This protein may include a predicted signal 
sequence of about 63 amino acids (from amino acid 1 to 
about amino acid 63 of SEQ ID NO:2 (SEQ ID NO:4)) and a 
predicted mature protein of about 115 amino acids (from 
about amino acid 64 to amino acid 178 of SEQ ID NO: 2 (SEQ 

20 ID NO:5) ) . 

The second putative nucleotide open reading frame 
encompasses 498 nucleotides (nucleotides 389-889 of SEQ 
ID NO:l; SEQ ID NO: 6) and encodes a 167 amino acid 
protein (SEQ ID N0:7) . This protein includes a predicted 

25 signal sequence of about 52 amino acids (from amino acid 
1 to about amino acid 52 of SEQ ID NO: 7 (SEQ ID NO: 8)) 
and a predicted mature protein of about 115 amino acids 
(from about amino acid 53 to amino acid 167 of SEQ ID 
NO: 7 (SEQ ID NO: 9) ) . 

30 The third open reading frame (nucleotides 372-889 

of SEQ ID NO:l; SEQ ID NO: 10) encompasses 408 nucleotides 
and encodes a 136 amino acid protein (SEQ ID NO: 11) . 
This protein includes a predicted signal sequence of 
about 21 amino acids (from amino acid 1 to about amino 

35 acid 21 of SEQ ID NO-.ll (SEQ ID N0:12)) and a predicted 
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mature protein of about 115 amino acids (from about amino 
acid 22 to amino acid 136 of SEQ ID NO: 11 (SEQ ID 
NO: 13) ) . 

Tango-77 is predicted to be 35% identical to human 
IL-lra at the amino acid level. 

Example 2: Expression of Tanan-77 mRNA in H uman Ti wa ,,»« 

The expression of Tango-77 was analyzed using 
Northern blot hybridization. A PCR generated 989 bp 
Tango-77 product was radioactively labeled with 32 P-dCTP 
using the Prime-It kit (Stratagene; La Jolla, CA) 
according to the instructions of the supplier. Filters 
containing human mRNA (MTNI and MTNII: Clontech; Palo 
Alto, CA) were probed in ExpressHyb hybridization 
solution (Clontech) and washed at high stringency 
according to manufacturer's recommendations. 

Tango-77 mRNA was not detected in any unstimulated 
tissues (brain, liver, spleen, skeletal muscle, testis, 
pancreas, heart, kidney and peripheral blood leukocytes) 
mRNA on Clontech Northern blots. 

Over 96 cDNA libraries were then tested for the 
presence of Tango-77 using PCR amplification. Only three 
libraries displayed a positive signal. These libraries 
were the TNFa-treated bronchoepithelium, TNFa-treated SSC 
cell line and ant i-CD3- treated. T cells. 

Example 3: Characte rization of Tanao-77 Protein^ 

In this example, the predicted amino acid sequence 
of human Tango-77 protein was compared to the amino acid 
sequence of known protein IL-lra. In addition, the 
molecular weight of the human Tango-77 proteins was 
predicted. 

The human Tango-77 cDNA (Figure 1; SEQ ID NO:l) 
isolated as described above encodes a 178 amino acid 
protein (Figure 1; SEQ ID NO:2) or a 167 amino acid 
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protein (Figure 1; SEQ ID NO: 7) or a 136 amino acid 
protein (Figure 1; SEQ ID NO: 11). The signal peptide 
prediction program SIGNALP Optimized Tool (Nielsen et al. 
(1997) Protein Engineering 10:1-6) predicted that 
5 Tango-77 includes a 63 amino acid signal peptide (amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 
NO:4)) preceding the 115 mature protein; or preceding the 
115 mature protein (about amino acid 52 to amino acid 167 
of SEQ ID NO:7 (SEQ ID NO:8)); or preceding the 115 

10 mature protein (about amino acid 21 to amino acid 136 of 
SEQ ID NO:ll;SEQ ID NO:12) . 

As shown in Figure 2, Tango-77 has a region of 
homology to IL-lra (SEQ ID NO: 14) . 

Mature Tango-77 has a predicted MW of about 13 kDa 

is and the predicted MW for the immature Tango-77 is 19.6 
kDa, 18.5 kDa or 15.2 kDa, not including post- 
translational modifications. 

Example 4: Preparation of Tanoo-77 Proteins 

Recombinant Tango-77 can be produced in a variety 

20 of expression systems. For example, the mature Tango-77 
peptide can be expressed as a recombinant glutathione- S- 
transf erase (GST) fusion protein in E. coli and the 
fusion protein can be isolated and characterized. 
Specifically, as described above, Tango-77 can be fused 

25 to GST and this fusion protein can be expressed in E. 
coli strain PEB199. ExpiFession of the GST-Tango-77 
fusion protein in PEB199 can be induced with IPTG. The 
recombinant fusion protein can be purified from crude 
bacterial lysates of the induced PEB199 strain by 

30 affinity chromatography on glutathione beads. 
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Example 5: Alternatively spliced forma of IL-lra and 
Tanao- 77 

Computer program Procrustes (Gelfand et al . , 1996, 
Proc. Natl. Acad. Sci. USA, 93:9061-9066) is an alignment 
5 algorithm that predicts the presence of alternatively- 
spliced exons for a protein of interest in a stretch of 
genomic DNA. Using the IL-lra sequence, Proscustes was 
used to search for the presence of additional sequences 
that might encode for alternatively spliced forms of IL- 

10 Ira in the two overlapping BAC genomic sequences (see 
Fig. 3 and Fig. 4) . Potential sequences that encode 
variant exons for IL-lra were identified. These 
predicted exons aligned well with the N-terminal region 
of IL-lra, but were not present in Tango-77. The results 

15 from Procrustes predicts the existence of more spliced 
forms of IL-lra. 

Furthermore, Procrustes also predicted an 
additional sequence in BAC1 and BAC2 that encodes an 
alternatively spliced exon for Tango-77 (T77-procrustes ; 

20 Fig. 5). This predicted splice variant form of Tango-77, 
T77-procrustes, was aligned with Tango-77 (Fig. 6) and 
with IL-lra and IL-1/3 (Fig. 7) . 

PCR primers within this sequence can be used to 
generate a product that can be used to screen a panel of 

25 cDNA libraries using standard techniques. Suitable cDNA 
libraries include libraries made from TNFa- treated 
bronchoepithelium, TNFa- treated SSC cell line and anti- 
CD3 -treated T cells. The resulting cDNA clone (s) can be 
isolated from the library and sequenced to identify 

30 additional Tango-77 cDNAs. 
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Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid molecule comprising a 
nucleotide sequence which is at least 45% identical to 
the nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ 
ID NO: 6, SEQ ID NO: 10, the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, or a 
complement thereof; 

b) a nucleic acid molecule comprising a fragment 
of at least 300 nucleotides of the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, the 
cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807, or a complement thereof; 

c) nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807; 

d) a nucleic acid molecule which encodes a 
fragment of a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID 

NO: 12, SEQ ID NO: 13, wherein the fragment comprises at 
least 15 contiguous amino acids of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO:ll, SEQ ID NO: 12, SEQ ID NO:13, or the 
polypeptide encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807; and 

e) a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID N0:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
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SEQ ID NO: 11/ SEQ ID NO: 12, SEQ ID NO: 13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the nucleic acid molecule hybridizes to a nucleic acid 
5 molecule comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10, or the complement thereof under 
stringent conditions. 

2. The isolated nucleic acid molecule of claim 
1, which is selected from the group consisting of: 

io a) a nucleic acid comprising the nucleotide 

sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO: 6, or SEQ 
ID NO: 10 or the cDNA insert of the plasmid deposited with 
ATCC as Accession Number 98807, or a complement thereof; 
and 

is b) a nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO : 4 , SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO : 8 , 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 

20 plasmid deposited with ATCC as Accession Number 98807. 

3 . The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4 . The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 

25 polypeptide. 

5 . A host cell containing the nucleic acid 
molecule of claim 1. 

6. The host cell of claim 5 which is a mammalian 
host cell. 
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7. A non-human mammalian host cell containing 
the nucleic acid molecule of claim 1. 

8. An isolated polypeptide selected from the 
group consisting of: 

5 a) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO:2, SEQ ID NO: 4 , SEQ ID 
NO:5, SEQ ID NO : 7 , SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
10 NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO: 9, SEQ ID NO:ll, SEQ ID NO: 12, or SEQ ID NO: 13. 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 

is SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 

20 comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10 or the complement thereof under stringent 
conditions ; 

c) a polypeptide which is encoded by a nucleic 
acid molecule comprising a nucleotide sequence which is 

25 at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, or SEQ ID NO: 10. 

9. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO: 2, SEQ ID 

30 NO:4, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807. 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

11. An antibody which selectively binds to a 
polypeptide of claim 8. 

5 12 . A method for producing a polypeptide selected 

from the group consisting of: 

a) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO:7, SEQ ID N0:8, SEQ ID N0:9, SEQ ID NO:ll # SEQ ID 

10 NO: 12, SEQ ID NO: 13, or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807; 

b) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

15 NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino acid 
sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the fragment comprises at least 15 contiguous amino acids 

20 of SEQ ID NO:2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, 

SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ 
ID NO: 13, or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number 98807; and 

25 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 

30 plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid sequence of 
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SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO : 6 , or SEQ ID NO: 10 
under stringent conditions; 

comprising culturing the host cell of claim 5 
under conditions in which the nucleic acid molecule is 
5 expressed. 

13 . A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 

10 b) determining whether the compound binds to the 

polypeptide in the sample. 

14 . The method of claim 13 , wherein the compound 
which binds to the polypeptide is an antibody. 

15. A kit comprising a compound which selectively 
is binds to a polypeptide of claim 8 and instructions for 

use. 

16. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of: 

20 a) contacting the sample with a nucleic acid 

probe or primer which selectively hybridizes to the 
nucleic acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

25 17. The method of claim 16, wherein the sample 

comprises mRNA molecules and is contacted with a nucleic 
acid probe. 
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18. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 

19. A method for identifying a compound which 
s binds to a polypeptide of claim 8 comprising the steps 

of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 8 with a test compound; 
and 

io b) determining whether the polypeptide binds to 

the test compound. 

20. The method of claim 19, wherein the binding 
of the test compound to the polypeptide is detected by a 
method selected from the group consisting of: 

is a) detection of binding by direct detecting of 

test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; and 

c) detection of binding using an assay for 
20 Tango- 77 -mediated signal transductions 

21. A method for modulating the activity of a 
polypeptide of claim 8 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 

25 sufficient concentration to modulate the activity of the 
polypeptide . 


WO 99/06426 


PCT/US98/16102 


- 105 - 

22. A method for identifying a compound which 
modulates the activity of a polypeptide of claim 8, 
comprising: 

a) contacting a polypeptide of claim 8 with a 
5 test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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TAAGTCTGCC7TGGCAGGCACTTGCAG * uTTTGAAAGAATCAGATATATC 

AAATrTGTAGTTTAAAATATTTAAGGGAACTCAATTAACTATGCTAGAAA 

AGAGAATTAAGTATTTAGGAGGATTTAATATGGTGTGAAAGTTGTGAAAA 

TCAAAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAACCAGGG 

AAAGGCATGAAGATAGAG7TCTCACACTTGTATCCCTGATCATGAAAAAG 

ATCTGC 

>Concig5 

GGGTTTTTCCGCGTTTTTACCCGAAATCTTCAAGGGATGGGAAAAAGAAA 

ATTGCTAAAAAATCTCGGTTTTTTGGTTTTAACAGATATTTACACCNTGG 

ATCCCATTTATTATGTTGTCCCCAAGGTTTTCGGTGGGTTCCCAATCAGT 

TAGCCCCCCTCCACAGTGAAAGCACTTTACTTTATCACCTTCACCTAAAG 

CATAAAATCCAGCTCTTGAAAGCTGCTCCTTGTTAACTGAATATATCCAC 

ATCCCAAAAGTAATGATCCATGCTTCATAATCTGCCACGGATGGATGGAT 

GGATGGATGGATGGATGGATGGATGGATGAATGGATGGATTGATTTCTTG 

GAGGATTTGTTGAATTTGGGAAATTCCACGCCAGGACAGCTGGCCCAAAC 

TGCCCGCGACAATCTGCTCGGTACAAGGGGAGGGTCCTGGAGAGGGTGCG 

GCCCGAGCCCCAGTTTGGAAATGCCAACTTGGCTCTGCAGCCGGGCCTTA 

GCCACTTGGGTCTGGCGTCCCTCCATTATTAGCGCCATGCCGGCTCGGGG 

TGCTGCCAAGTCCCTGAGAGCACAAGCC 

>Concig6 

CGCGCTCAAGAAAAGCTGAAGTGTGAATGTTCTGTC7ACCTTCACAGTAA 

ATGCTAAGAGAATGACCCAAGAGCAGAGGGTATCACTCTGCTACGGAGGA 

TTGATTG7AACTGGCTCTCCTGCCTTAGCAAGAAATGCCAGAACCATGG7 

CATTCAAGTTCTTGACCAAAAACTGCCTTCATGAGAATCAACTTCCCCAA 

GAAAAAAAAAGCAGAAACAGGCAAAGCTTCCAGCATGGTAGGTAATACTG 

ACCCTTCTTCCCTCCTTCCTTTGGAGATTCACACAGTAATAATGCATAAA 

GCTTTGCCAATGGACTAAGCACTGCCCAGGGGTTTTTGTCATGCCTGGAC 

TGAAATGCTCTTTTTGCGTTATCATAGAATCCCAGTGCAGTCTGAGTAGA 

CTCTAAGCAAAAGGGACATTTTTCAAAAAGGCTTTAAATTGCTAGTACAA 

AGAAGGCAACAAA ACTT GCGTAACTGTGGACAGATTAACTCAC TTGG TGT 

TTTGGCTCTTCAGTITTCCCTTGGCTGCGAAGTACTCCTGAAGCTTTCTC 

TGCGGCTCTTCCTGCAAGCAGGCAAGCAAAAAAACGACTGAACTTTATTT 

CGAGAT 

>Concig7 

GAAGAGCCGCTAACTTGCTGTAGTGATAAGGAATGAACTAAGGCTAGGGA 

CATATTAACATCCGCTGGTGGTGACTCTTTAGCCTAGATCTTACCCCACT 

C CTGCT CCT7CCATATGGTTCGGTCTCAGGCTCACTACCGATCAATGGCG 

7AC7AAAAGCAC7AAC7A7AGAC7CCAACACG7C7G7CG7G7G777CACG 

ACAAGCCGTGGAGTTAATCCCTCTGACAGTAGCTCAGATAAGGATGGGCT 

ATCATGGGCCCGGAACTGGGGCATGACGCTCGTCACCAACGCATGAGCTC 

CCCAAGTATGCTATACCTGTCCCTATGAAGGGCTTCCAACTCTATGTGCA 

GTCCCCATGTGGAGAGTCAGGTATTGATTGATCAAGCCAGGGGTGTGGTG 

AATGGGGAGCTTCCTACAGGGGTAATGATAATTGAAATGCACGGTGATGG 

GGATTTTCATATTGGTCTCCTAAGGAGATAACAGATTGGATGCGGGGTCG 

ATATTCCACTGCCCAGGGTGTGTACCGAC3GGTATCTGCAGGTGGATCTCC 

TCCCCACGTTTGATTAATACTCCTGTCTTGGGAAGCATAGACGGGCGGGG 

GAAATGATGAAGGGTGACCACTCCCC 

>Contig8 

GGGAACGCAGTGCTCTGTACGATGGCCTTGATTGCGAATTCCTGCAGGGG 
GGG 

>Contig9 

GGCAAGAGATTTAATATTCATTCCATCTTCATTTGGAAGATGAAAAATTG 
GGGACCAGAGAGGGGAGGGGACTGGGCCAAGTTTTCAAAGAAAAGTCAGT 
AGGAATTGTGAATTCCTGGGGGCCGGGGCCCATTAGTGCTGTTTTGGATC 
AGTAAATGGAGATGTGAGTTTCAACAGTAACAGGGACATTTTAAAATTAA 
AATGATTTAACCTTTAGAAAATGTCCTATTTTGTAATAATGATGGATTCA 
CAGGAAGGTACAAAGAAATGTCCAGAGAGTTCNTGAGCCCCCTTCAGCCA 
GCTTCTTCCAATGTTAACATCTTGCATTATTATAGTACAACATCAAAACT 
GGGAAATCGATATTGGTACTGTCCAGATAGCTTACTCAGATTTTGCCAGT 
7A7AC77CCAC7CA777G7G7G7G7G7G7G7G7G7G7G7G7G7G7G7G7G 
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TG7G7G7AGC7C7A7GCr*A7777A7Gl . *GTAGCTTCATGTAAfi"CS3Ct*J 
AA7CACAA7AC77AAC7A7GCCC7CA7CACAAGAC7C7C7C77GC7A7GC 
TTTACAGCTGTATCCTCTTCATCTCCAAACCCTAAGCCCACCTCACCGCC 
TCCACCATCTCTAATCCCTGGCAACCACTATTCTGTGCTCCATCTCTGTA 
A77AA77G7G77AA77AA7G77A7ACAAA7GGAA7CA7GAAG7A7G7G7C 
CTTTGAGATTGGGCTGTTAATTTTTCACTCAGCACAATTTCCGTGAGTCT 
AA7C CAAC TTGTGTGTAGCAGTAATTCTTTC CTTATTATTGCTGAATAAT 
ATGCCATGGTATGGATGTATCACAGTGTGTCTAATCCTTTGCCCATTGAA 
AGGAATTTGGATAATTTCCAGGTTTTGGCTATTATGAATAAAGTGAACAT 
AAGACATGTGTGTACAAATTTTGGTGTGATCAAAAGTCTCATTTCTCTGG 

GATAAATGCCCGGTAATGAAATGGCTGGGTTGTGTGGG 
>ConciglO 

GCAAGAACACAGGCGCGTATTATAACCTTACTACCAAGACCTGAACCCAT 

ATAAAGGTTTATGCGTAACAATCATCATCCCTGTTCCAGAAGATTACACG 

TACGACCACGCCTGGCTCACCGACTCACGTGGGCCAGTACCAGAAATTCT 

CCCAAACAAA CAGT CGTGTCTGAAAACAATCGCGGTGACCTgCACGGTTA 

GAAAAGCCTGTTTTCAAGTCCTGGAATTGCCACATATTAGCTGGGTAACT 

TTGGGCATCACATTTACTCTCTCCGAATTTCAGATTGCAAAAACTCATTG 

GATTGTTTTGTGGATTGAAAGAAATAATGTAAATTTAGGCCGAGTGCTTT 

GACTTACGCCTGTAATCCTATCACTTTGGGAGGCCAAAGCAGGAGGGTCA 

CTTGAGCTCAGGAATTTGAGACCACCTCTGGCAACATAGTGAGATCCTGT 

CTCTACAAAAAATTTTT'lTrAAATTATCCAGCATGGTGGTACACGCCTGT 

ATTCCCAGCTAC7CAGGAGACTGAGG7G7GAGGATTGCTAGAACCTGGGA 

GA7CAAG7CAACAG7GAGCCG7GG77G7GCCAC7GCCC7CCAACC7CAG7 

GACAGAGGAAGACCC7G7C7CAAAAAAAAAAAAAAAAG7AG7AAG777AA 

AGAAC77AG7G7AGGCC7GGCA7A7AAA7GA7A77G77GA7G77GA7G77 

AGC77GAAGGCACA777A7AGGAG7AGGGA7777A7AACA77A7GAGCC7 

GAGAGCACA7A7AA7G77CCC 

>Contigll 

GGTC7AACA7GC7CCAAC7GAAGAAACCCCACAC77G7CCGGCAAGGAAA 

CTAC7ACAGA777CC7GACC7AC7G7GCAA77CGGGGCA7GCGACGGGAC 

7GTG777C7GGG7ACGC7G7C7CAGG77CG7C7GGGA7G7AAGAA77CAA 

C77CAG7AG77C7C7CA7AGACGCCGACGAGAGGGGCG7C7C7777C7C7 

GA7GAA7C7GCCAGA7C77CCAC77CA7AGAG7C7AAA7CC7CCGA77CG 

A7C7AC7GGAGACCCCCACG77ACAAAAACG7C7AACG7CGG7GACAGC7 

CCCCACA7AGGGAAAGA7CACC7GAG7C7CAC7ACC7CACA77AG7GC7A 

TC7CCAGCCCCA7GC7A7C7ACGAGA7GG7CACGCGAGG777AAGGGG7C 

7CCGA77CCGG7GG7CCGA77CAGC7AA7CG7GGCCC7ACG7GAACGA7C 

AC7CG7GC7CG7AACA7CGA7ACAGGG7CGCGC7GACAAA7GG7AC7ACG 

7AGG77C7CAGG7CAA7GCCGCG7CACGAA7GAGCC7AAC7ACCCCA7AA 

G7GCACG7AC7G7G77ACC777CC7G77CGGCCAAACC7GC7AC7G7A7G 

C7G7GC77G777 

>Concigl2 

AGGC7CCA7G7GC 7C7A GCCI^T7A7C7777CAAG7G7TrrA777GC7A 
A7C7A7 AAGGCCCCT7TCG7AAAA7G77CACrCA7rr7C7AA77AGA7A7 
777TT77AA7G77GAG777TGAGAG77C7TrAGA7A7TrTAGA7ACAAG7 
CCA^G7CAAA7A7G7GA7T7AOVAA7A7T77CrrC7CAATC7G7AArr7A 
G7TT7CA7CC7C77AACAGGGTCT777GGAGAGCAAA7AA777GA7777C 
A7AAGG77CAAA77A7TAA777777C77G7A7AG77CACAC77C7AG7G7 
7AAG7 C7AAAAAC7G7GCCT7G7CA7AGG7ACCAAAGGT777C7CCAG77 
r^lTlTCTAGAA GTTT AGAGTTTCATGTTTTACATTGGAGTCCATGATCC 
A77GT7AA77AAT7TrrG7A7A7AGG7AGA7G777AGGT77AGGG7TrT7 
77AAAAAAAAA77ACA7A7G777AA77GC7CCAG77CCC777CA77GAAA 
AGGG7A7CC77CC7CCA77GAA77GCCTr7G7CAGAAA77AA77GGACA7 
A7T7G7G7GAG7C7A777C7GGGC7C777A7CA7G77AC7777AAAAAA7 
GCA7CAG77CC7CCACCAA7ACC7CA77G7C77GA77A77GCAG77A7A7 
AG7AAGCC77AGCA77AGGAAAAG7G77777CC7GC777A77C7TrN7CA 
AAAAA77777GGA7A7TC7AGGGCC777ACA7A7AAA7777AAAA7AAC7 
77G7C7A7G7C7AACCGAAAGCC77A7GAAGA7777GA7AAGAA77GCA7 
7A7GCC7A7ACA77AA777AAAAAGAAC7GA7G7C777A77CAGT7GA77 
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CTGCTAATCTATGAACAlrtGCATCTCTw . CAAAGCATTTAGTCTTtdTt* 
AATTTCTG7CATTAATTTTTTAAAATTTTCATCCTAAAGATTC7GTATAT 
G7777G77GAA7 77A7G C77AAGCA777CAC777C7TGG7AACAA77A7A 
AATGATTTTGTGTTTTTTATTCCACTAGTTCATTTTCAGTGTGTAGAAAA 
GCAA7GAA77777G7G7G77GA7C777G77CC7ACA7C77GCAACA7TA7 
TGAACTCAT7TATTAGTTCTAGGAGGTTTTTTC A TTTTTCTTGTAGATAC 
CTTGAGATTTTCTATATAGACAGTCATGTTGTCTGCAAACAGGCACAGT7 



TGCAGTGGCTAGAACTTCTAGCACTATGTCAAATAGCATTGGTGAAAGCA 
GACA7CC7TG77CC77G7C77AGAGGAACA777GG7C777AA7C77GGA7 
TGCG 

>Concigl3 

GCGCC7CC7777C7C77CCAAAA777C7C77G7C7AG77A777G7CCAGG 

GAAATTTGAAAGCTCACTTACTGTGCAAGTCAGCAGGAAACAACTGGGTC 

TGTGCACAGCACCTAGCAAAGTTCTGCTCTAGGAATTACACTTTGGCCCT 

GAGGTAGATTTCTACAAGAACCTTACCTTCTAAGCAGCACTGGGGTTCAT 

CTTTTTCCCAGTCCTCAGAGCCCATTTTCACTCCTGAGTTCTCCCCCACA 

AAGGACATTTTCAACGTTGAGTTTATTACTCAACAGAAAATGGAATGAAG 

TCCAAGACCTAAGGAGATAGAAAGGGGACCAGTTATGGCATCTTCTCACC 

CCAGGACACCTTGCTGCATGTCTCTAGTGCTGAACAGACCACTGGCCTTG 

CTCTGTAGTTTGAAATGCTCGCTGCAACCAGAAAGGCACCAAGGGGCCAG 

ACCA7GC7C7CC7G7C7A7CACGCC77CAAAGCAGAA777CCCAAACC77 

GAG7CACAG7GC7AACACACGGGG7GCCA7AACA77777G77GA7777GG 

CATTTTACAAAAATAAAATAAAAAAGTTAAAAATGCATTGCTCTATTCTT 

GGGGCTGGCACACTATTGCCTTTGGCCAAATCCGGTCCCTGACTGTTTTT 

TTAAATAAAGTTTTATTGAAACACAACCATGCTCTTGTGTACATATTGTC 

TCTTGGCTGCTTCGAAGCTACAATA 

>Concigl4 

GTGTTCGCTTTTTAACACTTACCTAAAATTACTCTGTAATCCATGGATCC 
TTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCCCA 
TCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCCAA 
CTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTTTTAAAACT 
TTCTAAAATAGCACTATTCTATAGCATGAGTGTTCATTTACATrTTTGCA 
TATATTTAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACTCC 
CCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGCAC 
TATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAG7TTAAACTA 
AGTAAAATTGAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCCAC 
ATT7CAAGTGCTCAGCAGCCACGTGCGACTAATGACTACTGTACATCAAA 
CATATAGAACATTTCCATCATGGCAAAGAGCTCTATTGATAGTGTTCATC 
vAGAGTTTCTGTTCCAGGACCAAACTGAGGGTTGGGCTGCTATTTCTCAT 
GGCCCAATAACAAGATGCAGATGAGCTGGGGAGGAAGAGAGTTTTTATTT 
CTGCNACCATTTACCGGGAGAAGGCCTGGAAATCA7CACCAGGCCAACTC 
AAAATTATTACGTTTTCCAGAGCTTATATACCTTCTAAGCTA TATGT CTA 
CGTGTAAGTGTGCATTCACCTGAAGACGTTAGTGATTAACTTCTTTTAAT 
CTGTAACTAAGGTCTGAGTCCGGAAGATCTTCCCCTGGAGCCTCAGTAAA 
TTTACTTAATCTAAATGGGTCCAGGTGCTGGGGTAATTACCCTTATCTTG 
TCCCCTGCTAAATCATGGAGGTTTGGGGATTCCTTTAGAGCACCAATAAA 
CTTGTTTGTGGAGGCCTGGGGGTTTCTTCTGACCCACAATAAAACTTGTT 
TAATCCTAAATGGGTCCTGTTAAGAATTCCTTCTTTATTTTGTCATATTT 
TAAGGCCCAGAAAAGGCCTGGGCAAAACTCTTGATGGGCTTTTGTTACAT 
TCCAGCCTTTGTATAAGAACACTGGTTTTTAATATTTAACTTAACCATTT 
AGTCAGTACTGAA ACAG TTGTTATAGAGATCTGCATTAGTGA GACCT GGC 
CTGCCACATTTCCTTTTCTGAAGATCTTATGGTAGTGATCACCTTTGTGA 
. AAGGAAAATAAATCTTGGGACCTCAAAATCACTAAGCCAAAGAAAAAAGT 
CAAGCTGGGAAGAATC TGAC ACTTAAA TCCAA CACT GCTAA CTCATTCAT 
CTCACTCATTCATTCATTTTATTTTCTTTTTTCTTTCTTTTIT'lTTTTTT 
TTTTTTGAAACGAAGTCTTGCTCTGTCACCCAAGC7GGAGTGCAGTGGAT 
CTCAGGTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCTACCT 
CAGACTCCTGAGTAGC7GGAATTACAGGCACCTGCCACCACGCCTGGCTA 
A77777A7A777TTAG7AGAGACGGGG777CACCA7G77CA7CAGGC7GG 
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TCTCGAACTCCTGACCTCGTGATCCGC . .CCCCCTCGGCCTT3TTTGCT * 



CACACAAGAANATTGTACAAAACCCCTAGGA 

GGAAAAACCTATCACCGCCTCCTATGGAACTTAAAACAAAAAGAAAAGTA 
» ™ a « r^aaaTG&ATATTTCATTCTGGAAGAACATTGAAAAAGAACAGGA 


AGAAGAGAAAGCACAACTCGAACTGTCCACTABAAl lwu *™-™lZZZZZ 
CAGAATG7CTGAACCTCATCGAAGGGGTAAGTGAAAAAAATAAGCTCCTC 

CAGCTTTGGCCCAAAGTCTrATAATTTTTAAACATATTCCTAAATATAAT 

ATAGGAGAGATAGCCTTCATCTAAGTAGAAATTTAGCTACTCTTGTAAAT 

.i7<- ,^»Timm arnar aTfieeCATAAACAGTGTCTTTTGTGTAT 



^CACTGTTACTATGACO^CTTTACAAACAAAACTGAGGCACAAAGAMTT 

GGAAAACTAATCCAAACAAACTGGCTCCAAAAGGAACTTTGCTTTCTTTG 

GGTATCAAGTTCTGAAGAC?rACACATTTAACATTGAAACTGAGGTCAGAA 

GG^GTTTCTATGTAAAGTTGGAGTATTCTGAATACTCTGGGTAGCTAC 

AAATAGTATTrAAATTTTATCTTGGATTCTGCAGATAAGGATAAAATAGA 

XGGTAGGC^^GAGTATGATCCTTAGGAGA^.TTrTTCCTGAAGGAAAAA 

TATATTAATAAAAAATGATGGAATAAACTTCTAAGATCCTTGCCTAGAGC 

AAAAC^CATTCAGTCCTTTGGCTGGTAATGTTGAACATCAACAAAAAAAA 

GGAAAAGTTCAGTTTAAGTCTACTCCAG^CAmCAC^CA^ 

t AAATATTAACTATTTCTCTTTGTGGAATTGAACTAGAG i ^ -TT ITTC. 

TaTCCTCTTTTTTGGTTGTTGTATTATTTAAAAATGAGTACCTTTTTATT 

ATTGAAATCATTTCAAGTAATGCAGATAAATGATCAGCCCTCTCCCTGTA 

CAAACATACATACTTAGGCATCCCAAACTTCTCTCTGGAGGTGACCACCA 

TTGCCAGTCATTCATTCTGTTTTCATGCATGTCCATACAGTATAGGTATG 

TCGAGAAATGAAGTATTATATTTTTGTGAGTTGCAATTCTTTTACTCACA 

TTTTTGTGTACTTTGGTTGTCTTTTCTTGTGTTTTCCTAGTACC^TGTT 

ATGCTGACTTAGGCAGATGAGTTGAGTATTTrCCTTTTTGCCCTATAAAC 

TGAAAATAGTTTGTAT<3ACATC*G^^ 

ATAAAAACTTGCCCATAAAAATCGTCTGGACCGGTTTCTTGAGGATGCC 4 
GTGTTAGAGCC 

CGCTCTAACCTGGGCTACCAATGGTTCGTCAAGTTCTAGATTCTCTAWA 
. ---~T»w^owft^ftTrn«wr^rTr«TCTC3TTrTCAGCCCCGAGTCTCT 


TAGATCTGTCCTCTAATATTCCTATTGACTTTA(--i i » 
^ATCCTTrTGCTTrACTrTCCGAGAGACCTGCTTAACC^ATCTCCCAA 

CTCTTTTATTGAATTTCATTTCTTTTACTATATATTTTTTACTTTGAAT 

^CCTCTCTCTTCCTCACATTTTCCCCCATAGTATrrTGTCTTC^TTGA 

CAGTTCTACTATCTTATTACTCTGGAGATATTAATAATAGTTTWAAATT 

TTTATTTATTTTTA'rrTTCAAAACAGTGTCTrACTCTGTCACT^CGCTC 

GAGTGCAGTGGTGTGATCATGGATCACTGCAGCCTTGATCTCTGAGCTCA 

AGCTATCCTCCTGCTTCAGCCTCCCAAGTAGCTGGAACCACAGGCATGTG 

TCACCATACCCAGCTAATTTTTTTGTTTTTGAGGTGGAGTCTCACTCTGT 

AGCCCGGTCTGGAGTGCAGTGCTGCAATCTGGGCTCACAGCAACCTCTGC 

_„™ r . r ,^ nr ^ r ^^r^^fV'aaTTCTC:CTGCCTCAGCCTCCTGAGTAGC 



TGGGATTACAGAAALAUAI.iai-«-ki<*-ww»«w ww»»»_>. "Jl^IiiZ, 
AGAGACAGGGTTTCACCATGTTGGC CAGGCTGGTCTTGAACTCCTGACCT 

TCTGATCTGCCCACCTTOrcCTCCCAAAGTGCTGG^ 

CCACTGCACCCGGCCACTAATTTTTAAATTGTTAATAAAGACGAGGTOT 

GCTATGTrGCCCAGTATGGTCTTGAACTCCTGGGCTTAAGTAATCCTCCT 

GCCTCAGCCTCCCAAAGTGTTGGGATTACAGGTGTGAGCCACTGAATCTC 

& r & TTTTTT AAAAflTTTTCTTCTCTTTACCAAGTCTTTTTTCC CCTTT Cr 

GCTrTTTTGGGTTGTTTTATTTTGATCTCTATCTTGCTAGAAACCT 
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ATTCAACATCATGCTGAA* CCTTTCAA * uATCATCTTGTTTTTAGTJ&TL 
TCCTACCTTAACTCTCTGTCTTCTGCTAGTATGGGAAAGATGACCTGAAA 
ATCTAACCATTTATTTTTCCCCCATTAATATCATTTTATGATTATTCAGA 
AGTTAAATAA TTGT CATGCTGTCCTCCAAAAAGACTGAATCAACTAGCAA 
CAAATAAGAATTTTCTCACAGCTCTGCCAGCATTTTAAAAGAATAGCTTT 
ATTGAGCCCAGGAGGTCAAGGCTGCAGTGAGCTGTGATTACACCACTCTA 
CCCCAGCCTGGGTGACAGAGCAAAACCCTGTCTCAAAAAAGAAATTTAAG 
GAACAGCTTTATTGTTGTAAAATAGACATACAATAAACAGAGCACATATT 
TAAATTG7GCAACTTATACTTTGATATAACCCTGTGAAAACATCACCACA 
ATCAAGATAGTGAATATATTTATCACCTCCTGATACAGTTTAGCTCTGTG 
TCCCCACCTAAGTCTCATGTTGAATTGTAATCCCCAATGCTGGGGGAGGG 
GCTTTGTGGGAGGTGATTGAATTGTGGGGGTGCACTTCCCCCTTGCTGTT 
CTTGAGATAGTGAATGAGCTCTCATGAGCTCCCCTTCACTCACTCTCTTT 
CCTGCTGCCATGTGAGGATGTGCTTGCCTCTTCTTTGCCCTTCTGCCATG 
ATGTGTTTCCTGAGTCCTCCCTAACCATGCCTCCTGTACAGCTTGCAGAA 
CTGTGAGTCAGTTAAATCTCTTTTCTTCATAAATTACCCAGXCTCAGGTG 
GCTCTTTATAGCAGTGTGAAAAGGAACTAATATACCTCCTAAGTTACCTC 
AAGCTTGTTTTTAATTCCTTCTCCTCCCTTCCTTCATTGCCAAGCAAACA 
ACCACCTGTTTTCTGTCACTATAGATTAGTTTACATTTTGTGGGTTTTTT 
TTTTTTTTGAGACAAGGTCTGACTCTGTTGCACAGGAGCAGAGCAGCGTA 
TC 

>Concial7 

CGCGTTATAGGAGATGCGAACTTAAGAAATGATGATAAGGAGACTTTA7T 
AAATATAATTTTGAATTATTTTGCCATTACAGAAATTCTAATTATTTAAA 
ATTCTATTCATAATTTTTAATCACTGTACTTCCCAAGCTTAGCTTAGAAT 
CCTTCTGTGCTGAGGATTAATTTTAATTTGTCTTTTATAGGCC7TATCTA 
AAATCCAAGAATAATTGCCAGAATCAACCACCTTCTAAATCTGTAAGTAG 
AAATTAGTCTTTTTAAAAATATGCATTCATAAGTATGATTAGTAATAAAA 
ATAATAAAGATGTTAGCAACCTAAAGAACATGTATTTGAAAGGTATTTCT 
TACAGATATAAAAACAGTTTGGTTTAATAAGAGACAATCATTTTTTGAAA 
AGTATGACATTTTTTGAAAAGTAGTTTAGTTTTATTAACCAAGAAAAGCC 
TCAAGTGAACTTTAGTCCTCTTGATAGCTAACATTTATTGAATGCTTACT 
GTGTGCCTGATACTTTTCTGACTTGCATTACCTCACTGAGTCCTCACAAT 
CTTATGAGGCTACTATTAGTAGCCCCACTTTACAGATGAGCAAACTAAGT 
CACAGAAAGGTTAAATAGGTCGTATAGCTATTAAGTGACAAAGCTGAGAG 
CCTGTGATCTTAACCACTTTGGTATGCTGCCATGAAGTTAAATAGCTCAG 
TAGTCATTAAAAGAGAACATTTGCATTGAACCTTCCAAGCCACTTAACAA 
GTATATGCTTCCTAATCAATTTAATTTAGCTACATTAGATAGAATGGTAA 
AGGATCCTTAACTTAAAGTTTAAATGGAAGAAATTAGCCCTCTGAAAGAG 
GCACAGATTATTCATCTGCAATAAAAATCTCACCTTTAGTTTTTTAAAAC 

TGAAAAATTCTCACTGGTGAGAATTTTAATAAGTTTTAATGATTCACCAA 
ATCACTTCAGTCATATTTCAGTCATATGCATATGCATATATAGACATATA 
AGTTTTTATCTGTGTTCTGAAATGTAACTAAAATAGTGCTTCCTGAAGTG 
AAAAATTCrrCACTGGTGAGAATTTTAATAAGXTTTAATGATTCACCAAAT 
CACTTCAGTCATATTTCAGTCATATGCATATGCATATGTAGACATATATA 
TGTTGTATGTATACATGACATCATTAGACACTGTGAAGGATAGCAAAATG 
TATATAAGGCAAAATTTATGAACAATGGTTTAACGTTTGGGAAGCACTGG 
GTTACACTTTTACTTTATGCAGATTGAACCAGTATAGTATGCAAGTCTTA 
AGGAAAAATCTACTGGAAAGGGCCCTCATTCAGACTTCCCAGAGGCTTCT 
CTGGAAGTTGACAATACTGACTTCAGTACATCAGCTCGTAAATGAGGATG 
ATACCTACCTTATCTGCTTTACACAGTTGTAAAAGTAAAAAGTGAACTCA 
GGAAGGGAATTACAGAATTTAGGAGAAACTAAAAGCACGATGTAAATAAT 
AGTCATCATTACAGTTATATAATGCTTGACAATTTATATAACACTTTCGA 
TACATGACAACAATAACTAACACCCAGACATGTTTATATACATTACCTCA 
CTCAGAACAACCATGTGAGGAAGTTGGCCATATGCTTTAATGTCCAAACC 
AGGACACTTTTGAGAGTAAAAAGCAGTACTCTTTGACCAACAGGCATAAA 
TCAAAACTATCTTGTGAAAACCGGGATATATGGCATCCTTCCTAGATAAT 
AGATACTTTTACTATTATTAATTTTGCTGTGAATCTAAACCTGCTCTAAA 
AAAGTTAATTTTAAAAAGTAATGAAGTACTGATACATGCTACAACATGGG 
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rAAATCTTGAAAACGTT,* jTGCTAAGTG. , AGAAGCCAGACAGAAAAGtec^ 

ACATATTACATGATTCaTTTATATGACACATCTAAAATAGGCACATCTA 

TAGACATACAGAGACAGAAAGTAGACTAGCGGTTGCCAAGAACTGCAGGG 

AGCAGAAGATGGGGAGTGACTGCCAATANGAAAACGCATTACGT 

>Concigl8 

TGAATCGCAATGATATGTGCCACTTTGCACTCTCTGTGACATATATAArr 

ATTTTTAATGCATTCA1 1 rriTi'CTCAGAGTGCATTCGTTTGAAAACATA 

GACGGGAAATACTGGTAGTCTTCCTTGTCAGTTAGAAACACCCAAACAAT 

GAAAAATGAAAAAGTTGCACAAATAGTCTCTAAAAACAATGAAACTATTG 

CCTGAGGAATTGAAGTTTAAAAAGAAGCACATAAGCAACAACAAGGATAA 

TCCTAGAAAACCAGTTCTGCTGACTGGGTGATTTCACTTCTCTTTGCTTC 

CTCATCTGGATTGGCATATTCCTAATATCCCCTCCAGAACTATTTTCCCT 

GTTTGTACTAAACTGTGTATATCATCTGTGTTTGTACATAGACATTAATC 

TGCACTTGTGATCATGGTTTTAGAAATCATCAAGCCTAGGTCAGCACCTT 

TTAGCTTCCTGAGCAATGTGAAATACAACTTTATGAGGATCATCAAATAC 

GAATTCATCCTGAATGACGCCCTCAATCAAAGTATAATTCG^CjCCAATGA 

TCAGTACCTCACGGCTGCTGCATTACATAATCTGGATGAAGCAGGTACAT 

TAAAATGGCACCAGA CATT TCTGTCATCCTCCCCTCCTTTCATTTACTTA 

TTTATTTATTTCAATCrTTCTGCTTGCAAAAAACATACCTCTTCAGAGTT 

CTGGGTTGCACAATTCTTCCAGAATAGCTTGAAACACAGCACCCCCATAA 

AAATCCCAAGCCAGGGCAGAAGGTTCAACTAAATCTGGAAGTTCCACAAG 

AGAGAAGTTTC CTAT CTTTGAGAGTAAAGGGTTGTGCACAAAGCTAGCTG 

ATGTACTACCTCTTTGGTTCTTTCAGACATTCTTACCCTCAATTTTAAAA 

CTGAGGAAACTGTCAGACATATTAAATGATTTACTCAGATT7ACCCAGAA 

GCCAAXGAAGAACAATCACTCTCCTTTAAAAAGTCTGTTGATCAAACTCA 

CAAGTAACACCAAACCAGGAAGATCTTTATTATCTCTGATAACATATTTG 

TGAGGCAAAACCTCCAATAAGCTACAAATATGGCTTAAAGGATGAAGTTT 

AGTGTCCAAAAACTTTTATCACACACATCCAATTTTCATGGCGGACATGT 

TTTAGTTTCAACAGTA TACATA TTTTCAAAGGTCCAGAGAGGCAATTTTG 

CAATAAACAAGCAAGACTTTTTCTGATTGGATGCACTTCAGCTAACATGC 

TTTCAACTCTACA TTTA CAAATTATTTTGTGTTCTATTTTTCTACTTAAT 

ATTATTTCTGCAATTTTCCCAATATTGACATCGTGTATGTATTTGCCATT 

TTTAATATCACTAGACAATTCAATCAGGTTGCTACGTTGGTCCCTTGGGT 

TTACTCTAAATAGCTTGATTGCAAATATCTTTGTATATATTATTGTrTTT 

TCTCCTATCTTGTAATTTCTTTGAGCACATCCCAAAGAGGAATGCCTAGA 

TCAATGGGCACAAATAATTTGACAGCTCTTATTAAACATTATTCTGTAAG 

TAAAAACrGAACTACTTTTCAGTATCACTAGCAACATATGAGTGTATCAG 

CTTCCTAAACCCCTCCATGTTAGGTCATTATGAACTTATGATCTAACAAA 

rTACAGGGTCTTATCCCACTAATGAAATTATAAGAGATTCAACACTTATT 

CAGCCCCGAAGGATTCATTCAACGTAGAAAATTCTAAGAACATTAACCAA 

GTATTTACCTGCCTAGTGAGTGTGGAAGACATTGTGAAGGACACAAAGAT 

GTATAGAATTCCATTCCTGACTTCCAGGTATTTACACCATAGGTGGGGAC 

CTAACTACACACACACACACACACACACACACACACACACACACACACAC 

CATGCACACACA^CTAGATOU^CACTTGATTTTATACAAATACAATGAA 

TTTACTTTCTTTTTGGTTCTTCTCTTOICCAGTGAAATTTGACATGGGTG 

CTTATAAGTCATCAAAGGATGATGCTAAAAiTACCGTGATTCTAAGAATC 

TCAAAAACTCAATTGTTTGTGACTGCGCAAGAAGAAAACCACCCATGCTG 

CTGAAAGTCAGTTGTCCTTTGTCTCCAACTTTACTTCCTTTACCTCTCAT 

ATGTTTGTGAATAAGCCCAATAAGCAGACNCCTCCTACAAAGTGAACCTG 

GTCTCTTTCCTCCTAACAGGG 

>Contigl9 

GTCTTGTAACACAG GTAA GACGAGTTCAAGTTTTATTTCTTGNTTTTAGA 

ACGGTAGTG AGCGGTTTTCAGCNTGAGAC CACACCTAAGGTAAGTAGCTG 

AATTGGGGTTTTGTCTTGGCTAAAGTTTAACAACCAGCTGGTCTTAATTT 

CTCCTTACCATTAGAGCACTCAGTAATCATATAAGTTGTGTGATCATTCA 

TTTTGCTTAACrGTTTGTTTCTGTTTTTATTGCTGTTTC^ 

CATTGGGTTTGACCTACTCTATCTGACTTGATCAAATCCAAAGGAAATTT 

CCAAATTATGGGGAATGAGGCCTCTGAAGTGGCTAAATTCCCACCCTCCC 

ACACACACAAACGTGGTATGGTGGGGGAAAAAACGGCCAGCAAAAGAAAA 

AAAAAAAGGAAAAGATGTTTCATTTTGACCACCAAACGGGCrTTATTTAC 
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ATAACAAGGCCACCTTT ~ *GCTAGCCA ^CCATACTGAAAGAGCSKTGL^ 

TGTTGCCCCATGCTGTGGGTTCCATAGCTAACGTTCTGCCri'll'TTCCTA 

CCACGACAGCCTGGGTTTGGTTCCTAAATCAAGC CT TTTCTGGTTTGATA 

CTTGGTAATGCTGAAATAGCAGCAATTTGTCCTAGCTGAAATATCGTAAT 

AAGATTTTAAAAGATTTATTTTAAAGGACCTCAATAGTTAAAAGTCAGCT 

TAATTAAAAGCTAACATCCAAGATGTGTGCATGTGTATGTATGCGTCTTT 

GTATTTAAATAGCCCTCATGTTTTTTTTTTCTTTCCTAGGAAC TTGC CTT 

TT^TGAGCAAAAGTTTTTTTCTTCTCTGTTGACTGGATTCTGTTTTCTT 

CATTTACTTCTGCTGTCTCTCCTTTCTCTTGCACCGTCTGCTGCATGAGA 

GCCCTAAAATAGTTTATAATAGCCTGGGGTTCCTTAAAGAAAATGGAGAA 

GG7GCCAGGCTCCCTTTTAGGGAGAAACTTCTATTTTTCCTTATGGAATC 

CCTAGAGTGTAAACAGACAAGTTCATTTCAGCTCTTAAACTGCTTGCGTT 

TGTGTTGTGTTACCTGATTTTTTTGACTATTATATTTTTGACTAGCTATT 

GCAACAGAAGCTACTCTTGGGTTTTCAAGGAAGATTGTAGTTTAGACATG 

TAGAAATGTCTTTTAAAAAAAAAACAAACTTTTTTTTAAGTGCACTGTAA 

AAGCATCATATGGTCTAGCCTCCTAATAATTTTCCCTTTTTGGAGACCAG 

GATTCAGGGTGGGCTCTGCCCAGAGCTCAGAGATCCAGTTAAAAGAGAGG 

TAGTCTCGGCCGGGCGTAGAGGCCCAGCCTGTAATCCCAGCACTTTGGGA 

GGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCCA 

ACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTG 

GTGGCAGGTGCCTGTAGTCCCAGCCACTCGGGAGACTGAGGAAAGAGGAG 

AATCGTTTGAACCCGGGAGGCGGAGCTTGCAGTGAGACGAGATGGCGCCA 

CTGCACTCCAGCCTGGCGACAGTGAGACTCCGTCTCAAAAAAAAAAAGAT 

AGGTAGACTCGATGTTGTCGTACCCGAGCAAGTTAGAGCAACGCCACACT 

TTGAGACGAATTTAAGAGTCCTTTATCAGCCGGCGACCAAGAG ACGG CTA 

ACGCTCGAAATTCTCTCGGCCCCTTGGAAGGGGCTTGATTTTCCTTTATG 

CTTTGGTTTAGGAAGGGGAGGGGAGCTCAGTTGCAACAATTCTACAGGAG 

TAAAAACATGCAAAGAAATTAAAAAGACAAGTGGTTACAGGGAAACAAAC 

AGTTCCAGGTGCAGGGGCTCTAAATCTATCATAAGATGTTAGGTATGGGG 

GCTCTGCCGGACACAAACTCAAGGCTTTATGCTGTTATCTCTTGAGCGAA 

ATCCTGGGAACTTCGTACATTGCTTGCTTCAGTACCTTATCAGTTAATCG 

GACTCTTTGATATGTTGGGAGTCAGCGTACACAAGTTAACTCCTTGAGGA 

AGGGGGTGGGTAAGGAGTCCTTGATGTCTGGTAAATGAAGGAGCGAAATC 

GAGTTCCTCTGGCTTTCTCAGCTAAGGGAGAGCTTATTCATGTGGAAACA 

AGGCTAAGTGATTAAGGGAGAAAGGGAGAGTCTGAAAACAAGGTTAGGTA 

TTAGAATGTCAATAAAATTGGTCTCCTTATACAGTCCTATGGTAGATTTC 

TTTC CATCTTTAA TCTC CCTCTAGCACCACCAGACTTTTTCTCTCTGTAC 

CTTGAGATGTAAATTTTGCTATCTGAATTTTCGTCTAAGAGTTGTTTCCT 

TTAATATGCAAATTTAGGGTTATTTAGCTGACAACTGCCAAAGTAGTGAA 

ACAAGTTATCAAGAACTTGAACGTCTAAGGTAGGAAAAAAAAAAGTCTTT 

ATGAATCTATAAGATGTACTTCTATTGGCATGCCTAATACGTCTATGTAT 

TTACG7GTTGTGTACACAGTTTTTCACTACTGAAAATATATAGAGGAGTT 

CTAATTAATTGACTTAAGACAATAAAAGCGCTTGAATCAAATACCTTATC 

AGGAAAAAGGAAAAGACAAGTCAAATGCTTGTTCAAGTCTATATAACTTA 

AGTAAAATCTTTAATAAATAAGCTAGCTTTAACATTATTTGAAATGTCTT 

AAGAATTGCCAGCAGGTTCTGGGTTACAGAACTAGTGGGGGTGCAGTGGG 

GTGAGGGTTGGTGGGGTGGGNGGTNNNAQ3NNNNCNCCCCCCCCCCCCCC 

CCCCCCCCCCCCCTCCCCCCCCGCCCCGNGCGGGCCGCGCCCCCCCCCGC 

CCCCCCGGCCCGCCCCCCGCGGCCCCCCACCCCCCCCCCCCCCCCCCCGC 

GCCCCGCCCCCCCCCCCGCGCCCCCCACCCCCCCGCCCCCCCGCCCCCCC 

CCCCCCCCCCCACCCCCCACACCCGGCCCACACGCACCCCCCACCCCGAC 

GCCCCCGCCCCCCCCCCCCCGCAGCCGACGCCCCCCCCCCGCCCGCCCCG 

CCCCGCACCCCCGACCCCCCCCGCCGCCCCGCCCCCGCCCCCCCCCCCCG 

GCCCCCCCCCCGCCGGCGCGGCGCCCCACCCCCCCCCCCCAGCCCCGACC 

GCGCGCCCCCCCCACCCCCCCCCCAGCCCCCGCCCCCCGCCCCGACCC 

>Concig20 

GGCAGTACGCTATAATTCCCTCTTCACCTTACCTCATCTGTTCTCTGATG 
GATGTA Ci i ' l ' I T T 'l 1 1 T AGTTTCTAAATTCCCTTTTCCTTTGCTCTGGAG 
ATGGGTGATTGATGTAGTCTGGGTATTTGTTCCCTCCAAATCTCATGTTG 
AAATGTAATCCCCAGTGTTGGAGGTAGGGCCTGGTGGGAGGTGTTTGGAT 
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CATGGGGGCAGATCCC . JVTGAATAGC JGGTACTGTCCTCTCAATAG . J 

AATGAGTTCTCCTGAGATATGGTTGTTTAAAAGTGTGTGGCACTCCCCCA 

TTGCTCTCTTGTTACTGCTTTCGACATGTGACATCCCTGCTCCCCTTCGC 

TCTCTGCCATGATTGAAAGTTTCCTAAGGCTTCGCCAAAAGCTGAGCAGA 

TGTGGGTGCCATGCTTGTACAGCCTGCAGAACTGTGAGCCAAAATAAACT 

TCATTTCCATATAAATTACCCAGCCTCAGATATTTCTTTATAGCAACATA 

AGAGTGGCTTAATACAGGCTGGGCATGGTGGCTCACGCCTGTAATCCCAG 

CACTG7GGGAGGCTGAGGGGGGTGGAACATGAGGTCAGGAGATTGAGACC 

ACCGGCTAACACGGTGAAACTCCATCTCTACTAAAAATACAAAAAATTAG 

TCGGGCGTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCTGGAGGCTGAGG 

CAGGAGAATGGCATGAACCCGGGAAGCGGAGCTTGCAGTGAGCCGAGATT 

GCACCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCATTTAAAAA 

GAAAAAACAAAATTTCAAACAGAACAAAATGAAAAAAATACCAAGTGAAA 

GGCCCCTATAAAAACCCCTCTGGGGCCCATCCTCCCACCCCCTCAAGTGA 

AACCACATTTAACAATTTGGTGCATATCTTTCCAAACCTTTTGTTGTACA 

CATATAAAAAACATACATGCTTTGATTTGGCTCAGACTGTACATAGTGTT 

TTCCCTCTTGCATTTTACACTTAATATATCTTTGACATCTtTCTATGTCA 

GTGCATGTTGGCTCGATGATATTCTATCATTAAATACCCTTCCAAAAATG 

GTAAAATCATTTTAAAAAATCATTCACACAAGTACATATTTACAATTT7A 

AAAGAAAACAGAATCCCAAAACACAACGACAAACCTCTAAAAATAATCTC 

TATCTTTCCACCAGCATGGAACAGTTCATTCCTTTTTCACATAAAACGAA 

TTATGTGATTGGAAAGATTAACTCTAATCTACACATTTATATACAGAATG 

TTCTATTTGTTAAGCCTATCTGAAAATAAAAAATTCAGATGATTAATTCA 

CrTACACTTAGAAATTAAGTCAATATACTATGAATACACATTGTGATCAG 

TTATAATATGATGCTTCTTAGTCTAGGGTTTCAATTAAATAACAGTAAAA 

AAAATTGGATAAATAAGACAGCTAATAACTGAAAAATCCAGAAATTCAAA 

GATTATATTGCCAACTAAAACACTGCCATTTACATTTtTTTT l 'CCTACTT 

GGTAGCAAATGCTAATGGAATTCAATCCTGATTACTTAAAGTCAGTTCAC 

ATCACACATTCAATCAGGATAATACGAACATAATATGCCTACTATAGCGT 

TAGATTAAGACATAAAATTTTTTTGCTTGAAAGTAATGACTGCGTACCAC 

TTGAGACATTTGTCAACCACTTCAGCACATTGTTTACGAGTGACTGGATG 

TCCACAAGGAATAAAAACGACAGCAATATTTCTATCCATACAGATTTTGC 

AAAGCTTCTCCTCTTGCAGGTGTCTTAGCTGCTCTTCAGTACTAATCTCT 

TTCTGCAATGAAGTCTGACTTGATTCGTCTTGTGTACTGTCTTTCTGAGC 

CTTCACTGGATCTGCAATCAGAACCTCAAGTGATTTACAGTTGCTCCCAG 

ATGTCTGAATTTTTTCCTCCATTATTTTCTTAATGTCTTTGAAACTGAAC 

CGCATTCATATAGCTTCTTGTACCATAGGATTATGGAAGATGGTATCAAT 

TTTTCTAGTTAGTGATGGCGTTTTTTCAGCAGTTCTTACCAGACACTCCr 

ZAAGTGAATGGGATAAATGAATATTGTTTATATATTTTCGTGTCTTCTGT 

TCTAACAGATATTTACACCCTGGATGCCATTAACATGTTGTCCCAAGGGT 

CTTNCTGGGCT 

>Concig21 

CTTTCTCCCTTTTTACCCCCATTTTCGTAGGGATTTGGTTAAAACCCATG 
TAAAAAATCCAAACACCGGCGGGGAACGGGGGTTCAAGCTCGTATCCCCA 
CCACTTTGGGAACCCAAGGTGGCAGGATTGTCGGAAGCCAGGCATTTGAG 
CCCACCCTTGGGAAAAAAAAGAGAACCCCCATTTTTTTTGAACAAAAACC 
CCAACCCTCCCAGGAAAGAAATAAGTATGGCTGGGTTGAAGTCACCAAAG 
ATGGCCGACTGGCTGGTCAAGTAACTTTACCTGATGGTTCGTAGAATATT 
TACCTTCACCCAGGTGGGAGAATTGCTTGAGCCAACCCTCAGTGTGGATT 
CAGGAACTTGATTTAATTGGTATCGTGATTGTGGATTAGATTCTCAGGGA 
TGCATTCACTAAGTAAAAGTGATAATAGCTACTTTTAAGTAAAATAATGA 
ATGAATCAAACACTCTAAATCCATGGTGCTATGCTAAGCTCTTTCTGTAT 
TTTATCTCATTTGATATTACAAATATTTGATGTGTTAATAGTAATGACTA 
TCTCCATTT7TACAAGTAAGGAAACTGACATTGAGAGATTAAAAGACTAG 
CACAAATCACAAAGTAAATGAGATTTGAATCCGGTCTTGATTCCAAACTC 
TACAGTATTCTAAATTCAAGGAGACTAAATTATAAGATGGAGAGCCAATT 
TTACTTTATAACAGGGTTAGAATGGCAGAAGAGACCTGACATTCACACCT 
CTAGCCAGTGCATCATCTTCCTGTAGGCAAATATGCAGGAAATCTATAAT 
AAGAACGTCCTTTGGTGAAGGCCAGGTGCAGGGGCTTACACTTGTAATTC 
CAGCACTTTGGGAGGTCAAGGTGGGAGGGTCGCTTGATGACAGGAGTTTG 
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AGAACAGCZTGGGCAACAXAGTGAGACuwTGTCTCTACAAACAAWAG^ 

ACACAAAACAACTTCAAGAAAACTCCTTTGGTATGGATCAGAACAAGATG 

AATTATCTATCTGATCCAAATGCTTAATGACATTAAGCCACAGTCCACTC 

ACTGCCACAATAGAGATATACCTGCCAATGCCACTCAGGTAATCCCATCA 

AAAGTGGTAATGAGGTCTGCAGCATGACTTGTTCTTAGTGATCCCAGCCT 

GAGACCTTGAGATTGGAGCATTTTATTCTACATATGCACAAAACATCTGT 

TGAAAAATC77CTAAATTGATGCAATACATTCGTATCAAGAATACCTGTC 

TGTAATCTCCATAAACCCTCTCCTTTCTGTTTTAAAAAATAGTAACAGCA 

TTTCTCCTTACATGACAAAGAAATGACTTCACCATCTACGAAATAGTGAA 

TAGGAGCTGTGTGGAAGGAAATTAGCTCTACTTCTTGGTGGAGATGAGAA 

GGGAGTGTTCCTCTGAAAATCAAGGCTCTTGTCATGCTAGGAGCCAAAGT 

CGTTTTTTAGAGTGTGGACAGTTGAGAAGATAAGACAGGGACCATCCACT 

CATGTTTTTCTTATTCCATAGGCCTCTCTCAATTGGGCAAAGCACTCCAG 

ACCTTTTGGAAGAGTGACACCAAAGGCAAGCACCTGCTTGGCAGGCCCCT 

CAGCTTCTACGCAAGTATAAGTGAGTATATAAAATGGGGGTACTTGTGCT 

GTTGAGTACCTTATTTCCAAATGAGGCCTGCCGGTGTCCCTGTGGCTGTG 

AGAAGGCCTCTACTGGATAGGTGGAAGTTGTGTGTTCTCATCTTTTCTAA 

CCCTGGATTGACTTGCCCAAAAGGAAGCCATTATTAACACTATAATAAAA 

CCATCCTTAATCTGGGACTCTCTTCATGCAGTGGTTCTTAACCAGTGATA 

AACATGAGAGTTACTTTTGGAGCTTAAAAAAATTAAGATGCTCAAGGTCT 

ACCCAAACTGACTGAATCTCCAGAGGTGAGGCCCAGGGATGTATACTTTT 

GAGCCAGACCrCAGTTTACCCTGCAGAGCTCATAAGGTTGCATAACACCC 

TTTGTCAGCCACTCTGATGAAAAGAAAAATTGGTGAGGAATAAGTTTTAG 

AGAAGAAGGAGCAAAGGTGTTCTTGGCCAGTGAGAGCCAATGACAGGGAA 

ATGCAAACAATGTATCCACAAGAAAGGTAAATTACCCTATAGAGCATTTT 

AGGATAAATGAACATCTCATGCCTAGGGTTGAGAGAGGGTACAAAAAAAA 

AAAAAAAAAAGACCACTCTGGATACACAACGCGATAAATGGAATAAAGAA 

1 ITiTT T C CTTGTAAATTAAAAAAATC CTTTGTT ACTG AGGTAT AATTTAA 

TCTATTTTATGTATAGTTCAATGAGGTGTTATAGATAATAAATTTTTTTT 

GTAAATTATTATATTGTCATATACTCATACATTCATTTTTAAAAGTCAGA 

AATGTATATAACCATTAAACTTATAAATCATTCAGTCATTCAGAGATATA 

GATACACGAGC^TATTTTATATCCACCACAATAATTATTACCATCTCAAC 

AATTCCATCACCCCTCAAATTTCAAGCGTAGGGGTTTTT A AATGTCAAAG 

GAGTCTACTCAGTGGGAAGAAAGTTAAGGAAAAAACCTTTGGGGCT7TGG 

GCTCCTTCCCCCTGGGGTTAAAAAGGCAGGAAATTGGGCTTACCCCCCCT 

GAAATTGGGAACTGAAATTTTGGGAAGTTTAAAAAAAAAAAAA 

>Concia22 

TCAAGCAGCCTTCCTTCCTTGGCTTCCCAAATTGTTGGGATTACAGGCAT 

GAGTCAGGATTCCTGGCTTAGTTTACATTTTCTAGAGTTTTGTATAAATG 

GAAACATACAGAATGTATTTTTTTGCGGAGTGGGGGAGTGTTTCTATTTC 

TTTCTTTCCATTTTCCCCCCCCCNCCCCCCCGAGACGGAGTCTCGCTCTG 

TCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCGCAAGC 

TCCACCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCT 

GGGATTACAGGCGCCCGCCACCACACCTGGCTAATTTTTTTTGTATTTTT 

GGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGA 

CCTCGTGATCTGCCCGCTTCGGCCTCCCTAAGTGCTGGGATTACAGGCGT 

GAGCCAC CGTGC C CGGC CCAAGTGrrTCTATTTCTT A ACCAGCTTTCATG 

CAATCTTTTTTTATTTTACCATCTCTGTGATCCCACTCCCAAAGGTACTA 

GATGTCGATTGGTCCTTAGGATCAGCTACCATTTGCCCAACTGCTTTCCA 

GCCTTCCAAAAA Trnil ' l^rmTrr CTTAAAGATACTCCTGTGTGAGG 

CTCAGAACTCTTGAATTGCTACTGCAAATATGAACTCGGTGATGTGAATG 

CCAGGGAATTGCCTGATTGATCAAAGAAATGTATCCCCTTCTCCCTCACT 

CTTGCTGTCTTCTCATTTGTTTTCCCCATCCTTGTGGATTCGTGAATTTA 

AATATCCCTTTAATGTTATAATATTTTAATGGCGTTTGGCGAAAAGTACA 

GAATTAGGTGCAAGAGTGCATAGCTGTrATTTTTTTTTTGGCCTCTGAGA 

CTGTTCATATATGCAAGTTATTTAACAGAAAGTTCTGCAGTGACCTGAGA 

TGTCAGGGGGGTC7GATAGAGTACGTTTGAAGGCAGTTACTGGAAAAAAA 

TAATGCCATTTCTGGTTTGTACTTCGGTAAGTTCAGATGACCCAATATAT 

TGTTTACATGTGGCATTCAGTAAAAAAGTAGCTTCCCCTCCCT7TCTTCT 

TCGTTTTCTCCTTTCCTGCTTCTATAAAGCATCTGCTTTGGGAAACTTCT 
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TAGGAGGAGAGCTTGCCwCCCGTGGC .JVTGGAGAGGTCTTGCA^GA- : 

AAAAGAGATGCTCCCACTCAATGCAGGATGG7GTGGAGGTAAATGGGGAT 

ACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAAAAAAGGGA 

GGGGAAAGAGGAAGGGAATTCNNANATNAATTGCTGAATACGGGGATTCC 

ATGGCCTGGATCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAAGGCA 

TCANCTGATGAGGAGCAGCCTGAACTCCGGGGAGGACCTGTTTTTGGTGG 

ZCCGGAAAAAAATGCCTTCCACACACAGGGAGGCCACCCGGCTGATGGGC 

TdGGGGTTGGACGGACAGCCCTAGGACAGGCrTGGGAAACCAGGCTCAGG 

TAGGGCCTGCGAGGTTCTCGCTGCGTCTCTTTCCTTCCTGGTCTTAGAAA 

ATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGGCAG 

ATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCTCTC 

AGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAGACA 

GTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTCAAT 

GAAAGTTTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAATAT 

TCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACTTCC 

TGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGACAA 

AGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCAGTA 

GCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGCAGCAGAACC 

AGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACTCAGTCTCTA 

AGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGAAATAATTCA 

TTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGACTCTCTTTA 

CCTT7CCC7GGGATGGAAGATGCAAATGTTCCTGATCTCACTGTCAAAAA 

AGAAGAACCAGTGGGTATATTGTATGCTTGAGTTCCAGCCATTAGTCACA 

AGACATAGAGATGACTGCCATGTGTGTAGACTTTCTATAGACTGTGTGC7 

AAACCCGACCTGCCACTTCCAAGGAGTAGATGAGGAATGTCCATGGT7CT 

GGGGAGCCCTACCCCAATTTGGGGCAGACATTCCAAAGCTCATTTTCTGT 

GGAGGGGGTTGATGGTTAAAGGAACGGCTGGGATTTACTCTTCTTTCTAG 

GGCCAAGAAAATGACATGCTGCCTCCATGTTTAATCATCCTTCCCCCTGT 

TAATAACTATGGCTTTAAGTCCCCGGTTAGGGCCTTCCTCCAAAATTGGG 

GAAAAAAATTCCCCTCCCCCCCTAAAAATTTTTTTTTTAAAAAAACCTTT 

TTTTTTGGGGGTTGGGAAAAAAACCAAAAATTTTTTTTC CC CAGGGGTTT 

TTTAATTTAAATTTCTCCCCAAAAATTTGTTTTTTTTTTTCCGCGAAAAA 

AAGACCCCCCCAAAAAAAAAAAGTTTTTTGGCGGAAAAAAAAATATTTTT 

TTTGTGTTAAGAAATGGAGAAGAAGGGGGGTTTTTTTTTTCTT CTCCCCC 

CACCC3CCAAAGGAAAGGTTGTTCACAGATTGTTTTGTGTCTCCCGCCCA 

>Concig23 

ATGTGCCTGCGAAATCATCCTTCCAGAAATATTTGCCCCTTTCTTTTGTT 

ATAGAGTGGCACTGCCCTATATGGTGACCACTTGCCACATGTGGCTGTTG 

AACACTTGAAATTGGCTTGTCAGAAT7GCAGTGTAAAGTGTAAAACACAT 

ACCAAATTTCAAAGACATGGCACATAATAAAAAATGTAAAATATCTCATT 

AACAATTTTTATATTGACTGTGTAAGTAACATTTTGAATATATTGGATTA 

AATACATGGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTCTACT 

TCACATTTTTGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCT 

ATTAATGTCGTCAATAGGTTCTTGGGAACCACAATTTTAAACAAAATGAC 

ATATAAGAAAACGAATAACATTGAACAAAATGACATTATTCGAGGACCTG 

CTGCATGTTGTTTCACTTAAAGTCAGTGTCCAAGAAACTATCAGTGACAT 

TTAGTGAGGAATTGCTGTCCTTCCTGTTTACAGGAACCTGGGCAAGTTAC 

TTAATTCCTCTAAGCCCGGTTTATATCCCTGCAAAGAGAGAAGGATAATA 

ATCAC CAGTACTTAGTGATGTCGTAAGGAGAAAATAAAATAATAAATATG 

AAATGGCTGACAGTGTCCTTGTCACACAGAAGATGTGTGATCCACAGTAG 

CTGCTATTGTCTGCCTCACTTCACTAGTAATGGTCCAGGGAGGCCTTTAA 

TGTGCATGGTGCAGTACATTCACATGTTGGACATGGGTGAAGGGAAAGAC 

CAGGCTCATC7AAACACAATAGGATGCTTGTGGTGTTTTGAGGAGGAATC 

AAGGACTAGTTATCCACAGCTGTAACATGCATGGATCAAAAGAGATAAGG 

CACACAAAAGACTTTGTCAGTAGCAAAGCATTACAAAATGCAGAGACCAG 

CTGTGGGTGGTGGTGAGTCAGACCCAGCTTCCCTCTGTGCCTGGCTGAGT 

GGTTCTGGGCAAGTCACGCCATCTGTCTTGATGCCCTTCCCCATCTATAG 

AGAGGGAGCAACTGAGGCCCCTTCCAATACTGAAGTCCTTTATTTCTGCT 

ACTTTAGAAATATCCACATTTTTGGTAAATTCAAATGATCCAATGATTCC 
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A777CC7AA7G77CAAAAv. f AGCCCCAv. J^CATCTAAATGAATGAlfiA£*w :. 

AATAAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGC 

AACACACACACACACACACAGAACCCATAAGCCT7CATCTTTCCTTGGAT 

AAACGAGCC77CC7G7C7GGCCA777AAG7CACGA77AAG7AAA7GA777 

CCAAC7CGCC7777GCAGCAG77CAGA7GGG7C777CC7GCG7GGCAG7G 

GCCCTCCTGACTTATGATTTCCTGTGTGTCGGCCTGTTACCACTGCAGCT 

TAACTGAGGAAACAAGAACAAAACAGCCTCTGACCCCAAGAGACTGTTGG 

AG&CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCC 

CAGCC CTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGC 

CTTATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTC 

AGGGGTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTC 

CCTGCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGT 

TAAGCAGCCAGGTTTGGAA G TTCTTGCTGTGCAGGCGGGTGTGTGCATGT 

CCTCTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCA 

CGCCCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGG 

GGCCCAGGCCA7GA7GGGAAGCAC7C7C7GCC7AAAGAC7AGGG7GA7GC 

GCCCTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTT 

CCTCCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCAT 

AAAC7GAAAA7AACAAACCCAGGG7C777C7G7CACAGGAAAGGG77777 

TTTTATAAGATTAAACAGAGATGATTCAACACACCCAGGATATAACACAT 

GGGCCA7GAG7CAAGGCCAGGCA77GC7C7GG7CAGCC7G77G77TGGGC 

CCCCTTGGCAGGGCTCTCCCCTGAATCTTCCCCCTCTTGACTCCCCATCA 

C CACAGCACGTCCAGCTTTGGGTACAAGGCCAGTAAATGGGGAAGGGGGT 

CAGATGACA7AAAGAGCCCTTTCCTGTCCCATTGAAATATATTTGGATAA 

CAGATGGCATTTCCCCCTGTGTCTTGCCCAGGGCCCAGAGCCTCCACTTG 

CTAGAGGCAGACAGAGGATGGAGAGCCCCTTCATTAG7GGGAGGACATCA 

CAGGTGGGCAAGAAACCACAAGCTTGCACTGAGGCCCAGCCTTGAAATAG 

CAGCACCTGCCGGCACCTGTGGTCTGGGGACAGGG7CACAGGATGGAGGG 

GCCTCCTAAGCCTTTTATCTCTATGTACTAAGTACAACCCATTTTCCCAC 

CTCACAGAGCCAGATCAGCCTCTGTGAGGTCCTGGTGGCAAAAGGATAAT 

TGCCTGCCCGCCTGCCCGCGGTGGGGTGCTTGTGCTTGCATTCCTGGGAA 

GGTTGTTGGGTTACTCTGCAATAGGTCTCTCTGACCAGCTCACCCTCCTA 

CTGCAAACCTCAAACCAACTTCAAAGAAGATCCAGCACC 

>Contig24 

CGCGTAGTC7AAAGACTGAGTCTGAAGCTGTCCCTTCCTGCTATGGACT7 

CAGATTTTAGCCCACTTGAATTGCTCCATATCCTCCAAGCCATGGCCATC 

CCTTGACTCTCTGGGCTCCCAAGCACTTGCTGCCTTCATCACACAGTTTG 

AG77AAGGCAGAAAGAC7GG777CCA7G7ACAC777G7GGAAGC777C7C 

A777C777A7A7AA7C7C7G7CC777G7C7AC7GC777AAAA7C7AGAAA 

77G777ACAAACACAAAGG7GA7CC777AAAAGC7CAAAGC7GA77G7G7 

CACCAA7A7A7ACCAC7C77AA7GGC77CCCA77AAAC777 GAG7A AAGA 

Z777A7GGAGCC7ACA7AAGGCCA7GAC7ACC7GGC7C77A7777CC7CC 

TCA7CC7CA7C7CACCAACTCACTC7CCAC7CC7A7ACCCC7CACTCC7T 

CCCCCTCC7C7C7C7GAGCTCCAGAC7CCCAA TrAC C7AC TTCCA CCCTT 

777GACCCCCAGGGAC7TATC7CAGCC7GGAATrr7CCC7C7TrGC7C7C 

CAC7GAAC7G7CCAC7CCCAG7C7AAGACA7G7GC77A7G7CACACGCCC 

77ACCG7GC77ATC7CAGT7TGTAA7TATC7AC7CA777AGAAAAG7G77 

GA7GAAGG7C77CAC7G 7CAGC 777 CAGGA 7AG CAGGAA7CA7AGC7 GA7 

TTTACT7ACT7AACGGGGTTTCATTCTTTG7AACTTTTT7TTrTTTrGAG 

A7GGAGAC7CAC7C77GCCCAGGCTGGAG7GCAA7GGCA7GA7C7CGGCT 

CAC7GCAACC7CCACC7CC7GGG77CAAG7GA77C7CC7GC77CAGCC7C 

CCGAG7AGC7GGGAT7ACAGATGCC7G7CACCACGCCCAGC7AA777777 

G7A777777G7AAAGACGGGG77TCA7CA7G77GGCCAGGC7GGTCTCGA 

TCTCC7GACC7CAGGCGA7CCACCCACC7CAGCC7CCCAAAG7GC7GTGA 

77ACAGGCA7GAGCCACGGCACCCAGCCAC7CC777777AC77A7GGG7G 

AGAAGCCA77AGAGA7CA7T7C7TCrm'C777C7C7C77CAC7AAGGCA 

CCAGGG7CAC7AAG7AG7AGGA7AC777GAAC7AGAAC7CAAGAAA77GA 

G7TrrAA7777ACC7CACAC7C7CA7A7GAA77C7CCA7G7GACC7CGGG 

CCA7AC77CCCC7G7ACCC7G777CC7C7777A7AAAAG7AAGAG777AA 

AC7AGA7GG7C7CCGACA7GCA7CC77C7C7AACA7A77C7GGAACC77C 
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AATAAAC7AAGATAAAC ^GAATAATTAAAACTTAATTTAAAAGAAGAv 

SGAAAGGAAGCAGTTACATTAAGCAAAAGAGACATCTTCATGGTTGAAG^ 

AGTGTATGCCCTGGTGTCTGGATCCCATTTAGGAAACTTGGTAACCT7GC 

AATCT7GGGCAGATTGCTTAATTTCTCTAGACCATGACT7CCTCTTCTGT 

AAGATGTGATAAGAACATCTACCTCACAGGTTTCATGAGAGGATTAAATG 

AGATAATGTATTATAATCCCTTGAACATGGTAGGCTGTTATGTTAAGTCC 

TTTCCT CCTT CT CTGTAGCTATCATGGAATTTAAAAACACATTATAACTA 

GAGCATGAGTTGCGACTAAAGGCTCAATTGTCTCTGCATGTGTTGGCTCA 

TGCATGCTTTATTCCTCTGAAGAGCTTTTATACCAAGTGAAAGGAAATAA 

TTGCATTTCCCTGAAAATTCACAGGAAAAAGTTATGTTTTTCTCTTCATT 

CAAGTGATTCTGTTAGACCCAACCACATGGAACAATTTTAAAGTTGCTTC 

CAAATATATTTACAAATATTTCCTGTCTTCAAGGAACAATGGCAAGACCA 

TGACTCAGGTTCACATCCGGATTCCACCACTAACCATGTACCCAATTACT 

TCAGTCACCTTCATTCAGGTCTTACATATCACAGAATAAAATCAGATTTC 

ATCAGAGGAGGTGAAGACAGGGAGAGGAGATATTTCAATCCCTTCTCCGC 

AAC C C G CGTTTTTTTTTTTTTTTTAACAAGGAT CCTAGAGTTACTGAATG 

ATAGCACGTTTGAGGGGGAAAGACCCTAAGGATGATCTTTATAAGCCATC 

ACTTGGTGTTGGTGGTGATAAAAAACTCGAGTATCTTTATGCAGTGGAAA 

GAGAAGATTGGACTCGGAATCAGAAGCTTGAGTTCAAGCACTGGTTTCAT 

CAGTCTTGTGATCTTGGGTTGGTCACTTAACCTCTTCAAGGGTCCTCAGC 

TGTGAAAGAAGATAGTATCAGCTAATTCTTGTATGTGCAGTGAGGAGGCA 

GTGAGATAGTGCAGGTAAACTATAAAACAATTGTCACATGAAACGCATCA 

CAGTGATTCTTTGGACCCACAAGCTCCAATCTTATAAAACATATCCAGTC 

ACCCACCAACATAGATCATCTCACCT7GCATATCTGATTTTGTGGATCAT 

GGGGAAAAACTGCTGATTCCTAGCAAAACCCATGGCATAGGATAAGTGCA 

CAATAATTTTTTTTTCCTAAATGATTTAGATGACAGTGACTCATTAAGGG 

TTTCCTGAGGCCTCCTCAGAGTCGAGAGGTGGGTGCCTGAAGCCACCCAA 

AGTCCCTGTCACAGGATGGCTCCCAACGCACACACCACAGGCCTGCCCAG 

TATGTTCCACTATCTACCCAGTAGAGCCCTGCCCAGTACGTTCCACTGTC 

CCTTCCCTAGAAGAGGTGACTGTTGTTCACAGTCCCAGAAAAGCGGGCTC 

CCCAAAACAATGCAAGGACCCACCTCTCTCTGAACCTCACCCACCCTAGT 

TTTCCTTTAAAAATCAATTTACAAGAAGATCATGTGAAGGAAAAGGTTGG 

GTGATATTCTAACCCAAGTTAGCTGTTTCTCAACCAAGTTCTCTTTGAAA 

AATTCAACAACCACCTTTGGGGAATTATTTACAACAGAGGAGTGAGGATG 

GGACCAGGATAGGTATTGCCTATGTTGGTGGAACCAGGGTTTTTTTCCTG 

GATTACCAAAGAGATGGTATGCATTGCTCCCAGAAGCTAAATATCTTCAG 

GCTTTCAATGGTGGCCTTCACCTGAAAATGTTATCCCTGTTGAAGCTTTC 

AAGCCAGTATTTTCATAAGAACTATATT7TCTTTGGTGAACTGAGGCA7T 

ATAATGATGACTATACAGGTTCTTGAGTGACTGAAGCCATCATTAGCATT 

3TCATTATTTTTGTTTAGTTGCATCTCCATAGCAGCTCACATTCACAATG 

TGC7TTGCAATTGTTCCTTAGCAATAGCCCTCACAAGATTCTCAGGAGGA 

GAGGGTTAATCCGGATTAACATTTCTGTGAAGCCTAGCGAGATTAATCGC 

>Concia2S 

AAGAGT7TTAAAATTAAGTAAGGACGCCGGGAAACAAATCAATCCCAGCA 
AACATTTTGTTGGGATTTATCATTCAAGCAATTTTACAGTTATCCCTGTC 
AAATACATTAAGTGTTCAAAATTGGGCATAGGGGGAACAAAATAATAAAC 

ATTACTATTGGTGTAAGGAAATTAGATACATCTTCCATTATTTAGTAAAA 
TTACCATAACTTCTAACTTTGTGGCTTTAGGCAGTCTAGTCCACAGGCAG 
GAAGGAGGTTTGTTTTGGCAAATGACTGTTATCATCTTCTGTTTCAAAGC 
TAAACCATAAACTAAGTTCCTCCCAAAGTTAATTCAGCATATGCCCAGGA 
ATGAACAAGGACAGCCTGGACGTTAGAAGCAAAATGGAGTCAGGTAGGTC 
AGATCTTCTTCACTGTCTCAGTGATGGCAGTTTCATAACTTTAAATGATG 
GCTATCACAGTTTTCATAAATAATCTAGATAAACAGTTAAAATAAAATAA 
TTAGGTAAATGTAGTGCGATAAATATTAGTAGACAAACTCACCATAATTT 
AGAATCTAAAGTTAAATTAAATAATAATATTTCATTATTTGGTATTTTCC 
AAGAAAAACATATTGTAGGAAACCATTCTTTTTAAAAAAAAAAGTGTCCT 
TTTAAAAAGGTGAATAATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTA 
TGTATAAAACAAGGTAAAAGGAACAAGGAAATAAGGGAAATGTAAAGAAA 
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ATTATAGAAATAAAGTCjw fATTTTTTGGTAAGAAAGCTTAAAGXGWSA-*^- 

ATTTTAGGTAAGAAAGAATCTTACCTAAAATTTTGTGCTAGAATAAAGTG 

ACTGGCTAAGAAAGGGATGTTCAAAGCTATTTATGACAAACCCACAGCCA 

ATATCATACTGAATGGGCAAAAGCTGGAAACATTC C CTTTGAGAACTGGC 

ACAAGACAAGGATGTCCTCTCTCACCACTCCTATTCAACATAGTATCGGA 

AGTTCTGGCCAGGGCAATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAA 

TAGGAAGAGAGGAAGTCAAATTTTCTCCGTTTGCAGATGCATGATTGCAT 

ATTTAGAAAACCCCATCATTTCAGCCCCAAAACTCCTTAAGCTGATAAGC 

AACTTCAGCAAAGTCTCAGGATACAAAATCAATGTGCAAAAATCACAGGC 

ATTCCTATACACCAATAATAGACTAACAGAGAGCCAAATCATGAGTGAAC 

TCCCATTCACAATTGCTACAAAGAGAATAAAATACCTGGGAATACAACTT 

ACAATGGACATGAAAGACCTTTTCAGGGTGAACTGCAAACCACTGCTCAA 

GGAAATAAGAGAGGAAACAAGCAAATGGAAAAACATTCCATGCTTATGGA 

TAGGAAGAATCAATATCGTGAAAATGGCCATACTGCCCAAGTAATTTATA 

GATTCAATGCTATCCCCATCAAGCTACCATTGACTTTCTTCACAGAATTA 

GAAAAAACTAATAGCCAAGACAATCCTAAGCAAAAAGAACAAAGCTGGAG 

GCATTGTGCTACCTGACTTCAAACTATACTACAAGGCTGCAGTAACCAAA 

ACAGCATGGTACTGGTACCAAAACAGATATATAGACCAAAAGAACAGAAC 

AGAGGCCTCAGATATAACACCACACATCTACAACCATCTGATCTTTGACA 

AACCTAACAAAAATAAGCAATGGGGAAAATAATTCCCTATTTAATAAATG 

ATGTTGGGAAAACTGGTTAGCCATATGCTGAAAACTGAAACTGGACCCCT 

TCCTTACAACTTATACAAAAATCAACTCAAGATGGATTAAAGATTTAAAC 

ATGGCTGGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCC 

GAGATGGGTGGATCATGAGGTCAGGAGATGGAGACCATCCTGACTAACAC 

AGTGAAACCCTGTCTCTACTAAAAAATACAAAAAATTAGCTGGGCATGGT 

GGTGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGG 

TGTGAAACCAGGAGGTGGAGCTTGCAGGGAGTGGAGATCACGCCACTGCA 

CTCCAGCCTGGGCAACAGAGTAAGACTCCATCTCAAAAAAAAAAAAAAAA 

AAAAAAAGAAGGATTTAAACATAAGACCTAAAACCATAAAAACCATAGAA 

GAAAACCTAGGCAATACCATTCAGGACATAGGCATGAGCAAAGACTTCAT 

GATTAGAACACCAAAAGCAATTGCAACAAAAGCCAATTGACAAATGGGAT 

CTAATTAAACTGAAGAGCTTCTGCACAGCAAAAGAAACTATTGTCAGAGT 

GAACAGGCAACCTACAGAATAGGAGAAAATTTTTTCAATCTATCCATCTG 

ACAAAGGGCTAATATCCAGAATCTACAAGGAATTTAAACAAATTTGCAAG 

aaaaaaaaacccatcaa; AGTGGGCAAAAGATATGAACAGACACATCTC 

AGAAGAAGACATTTATGI GGCCAACAAACATGAAAAAAAGCTCATCATCA 

CTGGTCATTAGAGAAATGCAAATTGAAACCACAATGAGATACCATCTCAT 

GCCAGTTAGAATGGCGATTATTAAAAAGTCAGGAAACAACAGATGCTGGA 

GAGGATGTGGAGAAATAGGAATGCTTTTACACTGTTGGTGGGAGTGTCAG 

TTAGTTCAACCATTGTGGAAGACAGTGTGGCAATTCCTCAAGGATCTGGA 

ACCAGAAATACCATTTGACCCAGCAATCCCATTACTGGGTATATACCTAA 

AGGATTAGAAATCATTCTATTGTAAAGACACATGCACATGTATGTTTATT 

GCAGCACTATTCACAATAGCAAAGACTTGGGAACAACCCTAATGCCCACC 

AATGATAGACTGTGTAAAAAAATGTGGACGTATACCCCATGGAATACTAT 

GCAGCCATAAAAAAGAATGAGTTCATTCTTTTGCACGGAACTGGATGAAG 

CTGGAAGCCATCATTCTCAGCAAACTAACfiCAGGAACAGAAAACCAAACA 

CTGCATGTTCTCACTCATAAGTGGGAGTTGAACAATGAGAACACATGGAC 

ACAGGGAGGGGAATGTCACACACCAGGGCCTGTCAGGAGGTGGGGGGCAA 

GGGGAGGGATAACATTAGGAAAAATACCTAATATAGATGACGGGTTAATG 

GGTGCAGCAAACCACCATGGCACATGTACACCTACGTAATAAACCTCCAT 

GTTCTTCACATGTATCCCAGAACGTAAAGTAAAATTTAAAAAAGAAAGAA 

AGAAAGAAAAGGATGTTCACGACAAACCAGAAAGTCCAAGCATGTCATGA 

ATAGTCTGTGTAAGTCACAATAAGAGGATTTATTTAAAA AAACT TTTATA 

TGATAAAGTTGTCTATAATTAAAGGGAAATTATAATGGTCTTTCTAGAGA 

TTGGGTTGATGTTAAAAAACTACTTATATATTAAAAAATTGGTTAGAACA 

ATGAAATTTTCTTACGGGGTTGATTCACTCTTAATAAATTATAAGAGACT 

TAAGAA'i " ! I'ITT TTTAACCCAAAGTTCAGCT7TTATTGCATCTTGCTGTT 

TTAGGTTTTCTCTCCCCTTTAAAAGGGTGGGAAATAGTAATGCCCTCCTT 

CAACTCCCTTCAGCTCATATACGT7TTTTACCCTCAGATTCTGTTTGTTG 

TGTCCTGATGCTAACAATGTTTTCTTAAAGGTCTAAAGGAAATGTTTTCT 
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7CCAACA7AA7A77C7G1 VJCA77GCAGAAGG7C7777C77T7GCC77TCU 

G7AAC7GGC7 7AAC AGA7777A7G7777A77GAAA7AA777C7A7GCCA7 

TA77A77AAG7777GG777GC77AGAAAACAC7GAGA77AA7ACAA7T77 

T7AAAAA77A7GA77A77ACA7CCA7A7A7C777A7G7A7G7GC7777AA 

AG7CC77G7GACA77GAG77C7AGGGC77GAC7CC7GGG7C77AAAAGGA 

CAAG7CC7GC7AAA7C77AAA7AC7GACAGCAA77AAAGGC7CA7C77CA 

GGACTGGTAGAAAATGCCAATCAAAATAAACTGCATTCTTGAAA'CACAGA 

GCCAGAAA77AAAGC7A77CAAC7CAAGGCCCAGGAAC7A7AG7GGAAGA 

GGTGGG7GTGTGAGAT7GTAAGGGCCAATTTTGAGAGATAAAATAAGTTC 

AA777C7C7A7AAA77AA7CA7AA7CA77GA7G7CCAAGCCACAC7GA7G 

CAAGA7CAGCA7A7GGG7CC7G7G7CAGA77AACAAGG7777C77GAAGC 

A77AACC7AC7CC77AA7AAAGG77A7AGAGG77A7AAAAGGC77C7GGA 

AG77A7AGC7A7GG7CAAGA7AAAAA777CA7AGA77G77AA7ACAA777 

7GGAAAACAAA77TAA77GGC77C77GC7G77777A77AGGGC77A77G7 

77GGAAAA77AAG7C7CG7C7C7CAAAGAA7GAAGGC777CACC777777 

777777777777AA7CC77GAG77A7CAC777GG7CAAA7GAA7GAC77A 

T777ACAA7GACC777CA7CAAG7G7777AAACC777CAAA777GACAAA 

C777CCAAAA7CAAAC7ACAAA77A7G7C77777A7GACC7AA7GAA7CC 

T77AAAA7AC7AGG77CCC7AAAG7CCAAAAAAAAAAAAAAA7AACA7AA 

7G7GGC77ATT7GG7A7AAAAA7777ACAAGAAACA77G7CAAA7A7AAA 

A7A77G7G7GG7777G777GGGC7G7A777G7A7AAA7A7G77AT2GG7A 

7G7G77CCAAAA77A7AGGAAAC7CC7A7AA77C7GA7A7GAC77GG7G7 

ACA77A7CAG7AA7AA77A7AA77G77A7GG7AAA77A77G7G7GCCATG 

GAGG7AACAAA777CC7CA7CAAG7G7G7C777GAC7A7GG77GCCC7AA 

AAC77777GCCA77CACAGACAA77G7C77GC777GG7CC7C777AGAAG 

G7GG7777A7AA7CAGCTATAAAAC7C7AACGGG7GCTC77GAA7GCAGG 

C77AAGA7AGC777GGAGAC7G7GACATCAGAA7AGAGGAAAAAC7TrCA 

G7A77CA7GGAG7GC7GAAA7A77CA7GAA7A7CAAGCAAAACAGGAA7T 

AAC77CA7AGA7GGAAC7AAAAGAA7GC7 GAAG 7AA7 C T 7 7TrGAC77TT 

T77C77AGAA7G77GA7CC77CG7777G77777CAGAG7CNAGGAAA777 

77C7G77GAGA7A77GACAGC777AACAA77AAG7A7AC7CCAG7GAACA 

CAA777GG AG CA 

>Concig26 

A7C7AG7CA77CCCCAGCC7GACCAA77CAA7GGCCCCCA7C77AG77AA 

AA77CC7CACCC7GACAAGGCCCCATC7ACGCC7C7GACC7CATGCCCTC 

CAC7C7CAG7C77GCAC7CACCC7GCCACAC7CAAGGGC77CCCCAGGTT 

CC77C77AGA77CCACCGA7AGC7CAGGGAC777GCACA7GC7ACGG7C7 

C7GCC7GGC7CC7CCCCAGA7C77C7CA7GCC7AGC7GC77C7CA7CAGC 

ACCCC7CAGAGAC7G7CCC7GCCCCACC7C7CCAGG77CCA7ACC7GCCA 

CCC7CCCCCAA7CACG 7AACAG T7TC77CACAGAGCGAG7TACCATCCCA 

G7A777C C C7 AAC77 A777777G7G AC7GG7C7G77GC C7G7C7CCACCA 

CAAGAACA7AAGC7GCATGTGAACAGGAGCCT7G7C7A7CTTG7CACCCC 

AG7GGC7G7GACA7AACCTGATACACA77AGATGCTCAATGATG777GA7 

GAA7GAAG7GC7GG7AGTCCAACTGTGTTTCCTTGTCTGTGTAAG7A7G7 

C7G77G7GG7TTCCTAAGAACCTACAGCTCTCCCAC7GTGACTCCTGTTC 

7A7GG7CC7GA7T7GCTGGACTAGAA7CCTAACC7ACATGCTTAC7CTTA 

G7G7CC7CCCCCAGAGGC7GAA7CCCAG7CCC7AAACC7CCACCAAA7GG 

C7AAGACC7AGC77CCAACCAGACAGGCC7ACGC7GAGACC7CAGCACCG 

CCC77C7GCGG7C7CATCCTTAACG CA7C C77CAGGGCCCAGC77AAATG 

TC7C77C7CCAAGGAAGGC7A7CC7C777C7GCCCC7CAG7GC7C7CCA7 

GCC7CC7C7A7GCC7CCA7GCC7GC777CAACCC7GCAGAAGTGGAGAAA 

77GC7AA7C7GCTG7G77GACAC7G7GC7GGGG7GCC7TGGGCCAGGGAG 

CAGGC7GG7GG7G7GC7GATAGCCCG7GGCTG7GCCCAGG7CCA7GCTCA 

C77CC7GAGCCCCAG7GGAGTAGGC7CCCTrTCCCTTAT7GCAGCACTCA 

GAGGAAGGACG7GC77C7TAGGACAGA7C7GGCCAACC7C7CCC7CGTGA 

GAGAAGGCCCAGCCA7CC7CT7GCCC7C777C777C7CC7GCCCCCGAG7 

AA7AAAGG7GCC7GG7CAGAGCC7TC7AGAAGGAGACCCAAACATCCACC 

ACACA77CCCAG77CCAACCG7CA7CCACATGGC7GGCTG7GCAGG7AAA 

CGCAGAG7C7G777CACACACCCAACCA7C7AG7A7TGGA7GGGAGGACA 

G7AGCG7GACAC7C77C7CCAGCC77GAGCCC7AC7G7GGGCCCCACCCA 
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ACCCAGATACCAGAGC^JCCCTGTACTGGGATGCTATTGGATGCTTG*i^C :;: 

AG7CA7G7ACAAAGTTAGCCC7TTG7TA7A7AGAGT7AGC7ACG7ACATC 

TTCCrCTGTAGGGAACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTA 

ACCTGCAAATCCTCTGCTGAGCACCCTGCACTACATACAGTGGGTAGCAT 

G7GG7AGG7GC7CAATAACTATTGACCGA7AGA7TGAA7ACAGG7AGGA7 

GGTGACACAATCTAAGATCCCAGGGGTGGGGAGACCACACGCTTGGTTAG 

G3AGACCCAAAGTGGACCGTG7GGCCAGAAGAG7CCCGCAC7GCAC7C7A 

GTGACAG7GCAGAAAG7CAC7G7GGGAAA7C7AGAAG77TCTACAGG7TG 

CTATTTCATCATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTG 

TTGGGAAAAGCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTT 

CCACTCTCTCACCTCCACCCAGGCGTCCAGGTGTGGAGGACACTTGCCGG 

CTGCTTGTCT GCTG ACTCATCCCTTGGTTTCACTTGGAAAACCTACCACC 

AGCTGGCCTCTTTCCAAGCATCAGCCTCCTC A TTTT CT ' l 'AATCCCTTAGG 

TGTGATCTCACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATAT 

GAA7AAAAATGA TTATT TTGTCATCTTCCAATCTTCCTTTTAAAATATTA 

77TTA7AATTCCCTTTAGGAGGATCACCTAAGTGAAGACTAXTTTTACCT 

AAGAAA7G77AAAATGTAAAGACA7GGT7G7AATC7GGGGAT7CCTGTTA 

AAA7GGC7AGCAQACAG^J\GTCAGACGACAGGCTAGAAA7GTGTGAAGAG 

7GG77GCC777GAAAGGCGGAGTTGG7AA7GA77T7C77CCA77777CCA 

7GC777CCAA77CTCTACAAAGGCCTrAA7A77AC77CGA7AACCAGGAC 

C7C7GA7AACC7GCCCCCACCGAG7AAAGAC77AGC7GGGAAAGTCAGC7 

7CA7G7GAGG7AAAAGGAACCAGGTAATACACAA77CCCACTGCCAACTG 

7CGGG7G7GCAGGCCTGAGC7TCC7GCA7G7GGGAGGAAAGAGAAAGAAG 

AGAGAAAC7CCAAGATCCAAGAGATCCAGCAAGAAGGC7GGAG7C7GAGG 

ACGCAGAAAGC7GAATGGCACAG7TACCAC7ATTG7GC7GAGG77C7G7G 

GCC7C7GGG7C7CTTGACAACTGGGCAAAGACCCACAGAAAAC7A7CTC7 

AGACCC7ACC7GTGGGAGGGGAAAG7GCTTAAGA7CATTTACAGGACAGC 

CACC7GGACCTCAAATGGCTTACAG77CC7TCATCCAGAGGGTCTTCATT 

7AG7ACATACCAGGTGCTAAGC7GGGTGC7GGAGACATGACGGGGAACCC 

ATTTACCATGGCTTTGTTACTGTGACATTCACA7C7AGGGAAAGCCAGCA 

AAGGGGAGGGA7CGAGGAGAGCT7G7TAGGCAGAGAAAA7ACCCAAGGGC 

AAGGGAGAAGCCAGCCTGTTCTGAGCACACACAGTGG7TCCATCTAACTG 

GGCCTCAGTGCCAGGTTGGACTGGAGATGGGGCTGAGGAGCTGTCACAGA 

GCA77C7GGACACAGATGTCACA7AGTCCCTTGAGGT7AGGGTCCTTAGG 

CA7GGCAGCA77GCTTrGAG77Tr7CCT777G7AA7G77GCCA77CA7GA 

CAA7G7GGAAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGT 

7AGGAGAC7AAGATGTGAGCCAAGGAAAA7GAGGAACACCTGAACACTGG 

GGCAGG7GCAGGGCCCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAG7 

AGG7AGAA7CAAGGCAGCTGG7ACAGA7CT777A77ACA7A7AAAC7GGA 

A7AAGCCA7C7GTTCCAAGACAAAAGAGTAGGCGGAAAACAA7ACAAGAC 

AGAAA7GGAA77AGAACAAACC7GGGAGGAA7G7GGAA77AGAG7AGAGA 

G7CCAACAC7GGCTGCAATCATAAAAATGTAAAACAAACAAAAATTTGCT 

AGG7G7GC7TACTTAGAAATAATTAGCTGTCATATTAAGTTCACTTGTGT 

TA7GGC77AAA7GTGTCCCCCAAAATGTGA7G7GTTGGAAACTTGA7CCC 

CAA7GCAACAGAGTTGAGAGATGGGACCTTTAAAAGGTGATTAGGTCATA 

AGGG7TCTGCCCTCATAAATGAATTAATACTGTTATCATGAGAGTAGATT 

CCTGA7AAAAGGATGATCTCTGCCTCCTCCCCACAGCCCTCTTGTGCATG 

C7TTCC7GCCTTTCCACCTTCTGCTATGGGATGACACAGCAAGAAGGCCC 

7CACCAAA7GCAGCTCCTTGATCTTGGACTTTCCAGCCTCCAAAACTGTA 

AGCCAAACAAATTTCTGTTTATTATAAATTACCCAGTCTCAGGTATTCTG 

7TCTAGAAACACAAAATGGACTAAGA7CATTAAATTATCATTTTTTATCA 

GACTG77GA 

>Cont:ig27 

AAAA7A7AACAGAGAGTAAGAGGAAAA7TAC CT T ClTr C7T7T7CCTTTC 
CC7GCC7GACC7TAT7CACCTCCCATCCCAGAGCATCCATTTATTCCATT 
GATC777AC7GACATCTATTATCTGACC7ACACAATACTAGACATTAGGA 
CAA7G7GGCC7GCCTCCAAGAAACTCAAATAAGCCAAC7GAGATCAGAGA 
GGA77AA7CACCTGCCAATGGGCACAAAGCAACAAGC7GGGAGCCAAGTC 
CCAAAA7GGGGCCTGCTGCTTCCAG77CCCCTCTCTC7GCA7TGA7G7CA 
GCA77A7CG77CGTCCCAGTCCTG7CTCCACTACCAC77TCCCCCTCAAA 
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CACACACACACACAACA^JCTTAGATGTTTTCTCCACTGATAAGTAGGT^ • 
ACTCAATTTGTAAGTATATAATCCAAGACCTTCTATTCCCAAGTAGAATT 
TATGTGCCTGCCTGTGCTTTTCTACCTGGATCAAGTGATGTCTACAGAGT 
AGGGCAGTAGCTTCATTCATGAACTCATTCAACAAGCATTATTCACTGAG 
AGCCTTGTATTTTTCAGGCATAGTGCCAACAGCAGTGTGGACAGTGGTGC 
ATCAAAGCCTCTAGTCTCATAGAACTTAGTCTTCTGGAGGATATGGAAAA 
CApACAACCCAAACAACCAACAAAAGAGCAAGATGCTGCAAAAAAAAAAA 
AAATGAATAGGGTGCTAAGATAGAGAAAAGTGGGAGAGTGCTATTTAGAC 
AAAGTGGTAAAAACAAAGCCCCTTGTGAGATGAGAGCTGCCGACAGGAGG 
GGGCGGGTCATGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGC 
GGGTCGTGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGT 

cgtggttgtgggtttttgggtaggacattcagaggagggggcgggtcgtg 
gttgtgggtttttgggacattcaaaagagtctgaatgcacccaggcctac 
aacttcaagatggtaaaggacagctccaaggatcagaagaagcatgcttg 
gaactggggcattttgagaaggaggaaaaatatgcagagactagtgcttg 
cagagcttgcatgtggatttcatttgaggtacaatgaaaacccattaatg 
ggtttcacac^gtgcaatcxk:ctgacctcacttatatttcctaaaataga 
aaacagatcagaaggaaggcaatagagaagcagaaagtccaatgaggagg 
tttcacagcagtcatgggggtggggtaaggaaaagaagtggaaagaaaca 
gacagaattgggttatattttggagatagaaccaacagaaggaagaggag 
aaacaacatttactgagaagggaaaaagtaggagaggaataggtt7ggga 
aataaatcctgctgacattggaaaccccaaggaagcctcaaaagtatatt 

TACTTGCTTTAGATTTAAAAGAATAGGAAAGAAGCATCTCAACTTGGAAT 

TTGAAATCTATTTTTCCATAAAAGTATTGTTAAAT7CTACTCATACTCAC 

AAGAAAAG7ACATTCTAAAGAGTATATTGAAAGAGTTTACTGATATACTT 

AGGAATTTTGTGTGTATGTGTGTGTGTGTATGCGTGTGTGTGTGTTTAAC 

CTTCAATTGTTGACTTAAATACTGAGATAAATGTCATCTAAATGCTAAAT 

TGATTTCCCAAAGGTATGATTTGTTCACTTGGAGATCAAAATGTTTAGGG 

GGCTTAGAATCACTGTAGTGCTCAGATTTGATGCAAAATGTCTTAGGCCT 

ATGTTGAAGGCAGGACAGAAACAATGTTTCCCTCCTACCTGCCTGGATAC 

AGTAAGATACTAGTGTCACTGACAATCTTCATAACTAATTTAGATCTCTC 

TCCAATCAACTAAGGAAATCAACTCTTATTAATAGACTGGGCCACACATC 

TACTAGGCATGTAATAAATGCTTGCTGAATGAACAAATGAATGAAGAGCC 

TATAGCATCATGTTACAGCCATAGTCCTAAAGTGCTGTTTCTCATGAAGG 

CCAAATGCTAAGGGATTGAGCTTCAGTCCTTTTTCTAACATCTTGTTCTC 

TAACAGAATTCTCTTCTTTTCTTCATAGGAGATGCCTGAGATACCCAAAA 

CCATCACAGGTAGTGAGACCAACCTCCTCTTCTTCTGGGAAACTCACGGC 

ACTAAGAACTATTTCACATCAGTTGCCCATCCAAACTTGTTTATTGCCAC 

AAAGCAAGACTACTGGGTGTGCTTGGCAGGGGGGCCACCCTCTATCACTG 

AC7TTCAGATACTGGAAAACCAGGCGTAGGTCTGGAGTCTCACTTGTCTC 

ACTTGTGCAGTGTTGACAGTTCATATGTACCATGTACATGAAGAAGCTAA 

ATCCTTTACTGTTAGTCATTTGCTGAGCATGTANTGAGCCTTGTAATTCT 

AAATGAATGTTTACACTCTTTGTAAGAGTGGAACCAACACTAACATATAA 

TGTTGTTATTTAAAGAACACCCTATATTTTGCATAGTACCAATCATTTTA 

ATTATTATTCTTCATAACAATTTTAGGAGGACCAGAGCTACTGACTATGG 

CTACCAAAAAGACTCTACCCATATTACAGATGGGCAAATTAAGGCATAAG 

AAAACTAAGAAATATGCACAATAGCAGTTGAAACAAGAAGCCACAGACCT 

AGGATTTCATGATTTCATTTCAACTGTTTGCCTTCTACTTTTAAGTTC 

GATGAACTCTTAATCAAATAGCATAAGTTTCTGGGACCTCAGTTTTATCA 

TTTTCAAAATGGAGGGAATAATACCTAAGCCTTCCTGCCGCAACAGTTTT 

TTATGCTAATCAGGGAGGTCATTTTGGTAAAATACTTCTTGAAGCCGAGC 

CTCAAGATGAAGGCAAAGCACGAAATGTTATTTTTTAATTATTATTTATA 

TATGTATTTATAAATATATTTAAGATAATTATAATATACTATATTTATGG 

GAACCCCTTCATCCTCTGAGTGTGACCAGGCATCCTCCACAATAGCAGAC 

AGTGTTTT CTGGGATAAGTAAGTTTGATTTCATTAATACAGGGCATTTTG 

GTCCAAGTTGTGCTTATCCCATAGCCAGGAAACTCTGCATTCTAGTACTT 

GGGAGACCTGTA ATCA TATA ATAAA TGTACATTAATTACCTTGAGCCAGT 

AATTGGTCCGATCTTTGACTCTTTTGCCATTAAACTTACCTGGGCATTCT 

TGTTTCATTCAATTCCACCTGCAATCAAGTCCTACAAGCTAAAATTAGAT 

GAACTCAACTTTGACAACCATGAGACCACTGTTATCAAAACTTTCTTTTC 
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TGGAA7G7AA7CAA7G1 . rCTTCTAGGTTCTAAAAATTGTGATCAGACv-A 

TAA7G77ACA77A77A7CAACAA7AG7GA77GA7AGAG7G77A7CAG7CA 

TAACTAAATAA AGCTT GCAACAAAATTCTCTGACACATAGTTATTCATTG 

CCTTAATCATTATTTTAqTGCATGGTAATTAGGGACAAATGGTAAATGTT 

TACATAAATAATTGTATTTAGTGTTACTTTATAAAATCAAACCAAGATTT 

TA7A7777777C7CC7C777G77AGC7GCCAG7A7GCA7AAA7GGCA77A 

AGAATGATAATATTTCCGGGTTCACTTAAAGCTCACATTACACATACACA 

AAACATGTGTTCCCATCTTTATACAAACTCACACATACAGAGCTACATTA 

AAAACAACTAATAGGCCAGGCACGGTGGCTCAGACCTG7AATCCCAGCAC 

TTTGGGAGGCCAAGGTGGGAAGATCACTTGAGGTCAGGAGTTCAAGACCA 

GCCTAGGCAACATAGTGAGATCTCATCTCTACAAAAAAAAAATGAAAAAT 

TAAAAAATGAGCTGGACATGGTAGTACACACCTGTAGTCCCAGCTACTCG 

GGAGGCTTGAGGTGGGAGGATCACTTGAGCCTGGGAGATGGAGGCTGCAG 

TGAGCCATAATCACACCATTGCACCCCAACCrGGGCAACAGAGTGAGACC 

C AGTC TCAAAAGATAAATTTTTAAAAATGTTAAAAAATATATAAAAGAGA 

ATTTTAAAAGAACAACTAATAGATCAAAGCATGGATGCAAGATATATTTA 

GTTGGAAAATCAAGGTTAAAATCAAGGGATCTTGGAATTAGGTGTGGTAG 

ATTTGGGTAAGGAGTAGTCTAAGATGACCCTGTTTCTTGGTACTGGAGAC 

TGGATGAGTGGCAGCGTCTTAACCATATTTTTGGTAGAAATATGGAGGTC 

TTCTCCATTCCAGGATGAATGATGAGTAAAATTTTAGGCATGTAATTTGA 

GC7AC7AGAAGGACAC7CAA77GCAGA7G7ACAA7GGGGAGA7GATAACC 

TATCTGGAACTCAGAAAAATAACTGTATATAGATATGAAAGACATCAGTA 

GGTATGTAGTAGATAAAATCCTAAAAGTGATG7CAAAGGGAGAAGAGAAG 

TATATGGTGAACACTGTTGTTTGTCCATGCAATTGCCATC7CTTCTTCTT 

CCTTACTGACAGAACCCTGATTTCACTGAGAAGTCAACATGCCCT7CCCC 

AATTGATGAATCCAATTGGTTGAAGATTATGTTCATTCTATTCTTACATG 

ACTAAGTCACGTTGACTTAATCCTATCAAATGAGATGTCGATCTGGAAAC 

AACTTCTGGAAAAGATTTTCTACCTTGATAAAATAAAGAGCCATATAGAT 

GGTCCTTTATCTTCCTTCTTCCTTGAATGAGATATGTTCTATGAGGAAGT 

GAAGCTTAGAACTGTGGTCAGCAACTTGCAACGACTGGGAAGTCAGAGCC 

ACACAATGAAGAATGCAGAGTGGAAGGAGAAAAAGAGCCAGCATCTCTGA 

CAACATTGTTACACCGAGAACCTACCTCCAGATTTTAAGAAAACAAGAAA 

TGCTACTGTTATTAAGCCATTTCACTGGGTTTGCTATGACTTGCAGTCAA 

ATCTAGCTTAACTGATACAGAGCACCACAGAGAACTGGTCTCTCATTTGT 

CTCATCCTGTTCTTTCTAGCAGCCACGACTTTCCTAGGGTTTCCTTAGCC 

CAAG7C7GGC7AGAGCAAGAC7AAG7AAGAC77GA77CC77AA7G7CC77 

77G7777AAGAAA7A77AAAGAA77A77777A7A77AA7A7A7777AAGA 

.\ATAAGGAAATACAAAACACTGAGCAAGCAACACAAATTCAAGAAATC77 

AAAAAG7A7AA7AGC7GC7CAG7C7C7GA77AACAG7GAAA7A7GGAA7C 

ATTGTAGAAATGGCCTTGGAGCGTTATTCTCCCAGGCCAGCTATCCTTAT 

GGTCTGCCCCACCTCCCTCATTGCCTAAACAGTAAGAGAGTCCCATGGTG 

AGACTCAACAGTCTTAGCACAGAACTTGTTACAGTCTATTTCTTTTCTTA 

CAGTCCTATATATCAATTCCAAATCAATGAGAGTAAAGCCCAATCCCTGC 

CTTTAAACCCAAAGGACAGAAGCCCAAAGCCCAAAGATATTCCCTAACCT 

7CTCCCCCT 

>Contig28 

CC7G7CGC7CCC7A7GT7TAAAGC7GGGGA7C7C77777CC7G7G7C7AA 
77A7777CC7CA77GGCTTGAAAAA7C7GA7AAAACA7777AGGAC7G7G 
TA7AAAA7AGAA77AGCCAAG7GCAA7G7C777ArrCAGAAGAAA777CA 
7GGACG77G7GCC7AC7C7C77GGC77CC7GGC77CA7GGC777CCAGA7 
CCCACAG7AAGC7C7GGA7AG7AGAAG77A7AG7AAGAC7GACTTC7AAA 
7AAA7GAAG7GAC7TTAACC77AC7GA7A7GGC77AAAGAAAAGGAG7GG 
CC777AAGA7CCA7GAAC7TC7CAAACAAAAG7GA7AACG77A7C7CCA7 
GCA7A7A7AA7AC7AAA7A7AA7GCAAC7GAGAGAAG7AGGC7G7GG7AA 
GAAAGGAGACCCAAG7GCCA7C7GAAGGCAGCAC77ACCAC7C7GC77CA 
7CCCACCGAGGAAACAAAGCA7GAG7AT7GCCAGATTTTCT7C7GTTrCA 
AGAAAAGCCAGAAA7CCAGG77777GCG7GAAA7G7CC7GA7777AA7G7 
7GGGAAC7AA777A7A7777GAAA7AACA77G7G7GGGACAAG7GAAC77 
37A7G7GGAAC7GC777C7CCCAG7GGCGACCAG777GGACCG77GA7AC 
7CAGCAAG77CAGCCAAG7GCGCC77G7CA77G7CAG7CA7CAAGG7GA7 
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37G7GA± i. GGTCAAACAnXTAGTTTTGCTCAGCATCTCGTGTGTTTTCAA' 

AGGACCTGAGGGTTCATTTGCCCATGCAGATCrTGTAGTCCTGTTTATTC 

7ATTAA7TTATCTTGCAAATCTATAATGTTTTATTTTAAGCAGCGAGAGC 

CG7GGCAGCC7TTGG7C7GGACCC7T7C7AA7GA7CA77TAG7A7CAGGC 

TATGTGGGAGTTGATTGTTTTGCATTGCCTGAAAGCCAACAGTATCACTC 

C7CC7C7AGG7G7GGCAGAGA7G7GAGAGAGGGAGAC7GACAG7C7G7GG 

GZGTGTATGCAGTGTTGGGGGAAGCGAGGCACAGGGGACAATACTGTGGT 

GTATAAAACTAGTCTAAGGTAGCATCAGGAAGTTCATGAAGCCAAAATGA 

TTTTCATAACAGCACAAGACATTATTTGTTTTTGCCTCCCTCTCATTTTT 

T777777777GAGACAGAG7C77GC7C7G7CA7CCA7GC7CG7G7GCAG7 

GG7GCAATCTCGGCTCACTGCAACCTCCACCTCCAGGGTTCAAGCAATTC 

TCATGCCTCAGCCTCCTGAGTAGCTGATTACAGGTCTGCACCACCCCGCC 

GGCTAGTTTTTGTATTTTTAGTAGAGATGGGGTTTTGTAATGTTGGCCAG 

GCTGCCCTGTCAT 1 1 i mTTTACTAGTGTCCAGTGGAGTTTTTTAGGGG 

CTACATAACATGATACTGTCATTAATCTAATGGCTAATGAAAGGGATATG 

TATATGTTTTTGTGTTTAAAACAAACTTCTTTGGGGTCCTCAATAATTTT 

TAAGAGTATAAAGGGGTCCTGAGATCAAAGAGTTTGAGTTCTGCTGGACT 

GGGACAGTGGTTGTCAACCCAGATTGTACATTAGGGTCATCTGGGAAGCT 

TTAAAATAGTACTGATGCCCAACCTTACCGCAAACCAATTAAGCCAGAAT 

CTCTGTGGATGAGAAGTCTTCATTGTCATCATCACCATGACCATCATCAT 

TGTCACCGTCACTACACCATTATCATCATCATCATATCATCTTCArTATC 

ATTGTTAGTATCTCCATCACCATCATCAGCATCACCATTATTATCATCAT 

CA7CATCCCCACCA7CA7CC7CA7CGGAAC77CACC7GCA7GGAGGACAA 

TCCACTATGCATTAGGTGCTATGCTATTTGCTATACTCCTTATTCTCACA 

ACTG C Z CAGAGAGGCTGATATTATCTCACTTTATAACAGGAGGAATCTGG 

ATCGGAAAAGTTAAGGTAAGCTAATTCACAGAGCGAGAAGAGATAGAGCC 

AGGATTCGAAACCAGTTCTCTGCTACATCAATGTTCCCAGTCCTTGCACT 

ATTGAGAACCTCTTTAGTTATGCTTTCACCCCTCCAACACCACAGTAAAT 

TTTTTCTTTTTTTAAAAAAATTATACTTTAAGTTATAGGGTATATGTGCA 

TAATGTGCAGGTTTGTTACATATGTATACATGTGCCATGTTGGTGTGCTG 

CACTCATTAACTCGTCATTTACATTAGGTATATCTTCTAATGCTATCCCT 

CCCCGCTCTCCCCACCCCATGACAGGCCCTGGTGTGTGATGTTCCCCACC 

CTGTGTCCAAGTGTTCTCATTGTTCAGTTCCCACCTATGAGTGAGAACAT 

GTGG7GTTTGGTTTTCTGTCCTTGTGATAGTTTGCTCAGAATGATGGTTT 

CCAGCTTCATCCACGTCCCTACAAAGGATATGAACTCATCCT T TTTTATG 

GCTGCATAGTATTCCATGGTGTATGTGTGCCACATTTT C TTAATCCAGTC 

7A7CA77G CTGGACATTTGGGTTGGTTCCAAG7CTTTGCTATTGTGAATA 

37GC CACAGTGAAC ATTCATGTGCATGTGTCTTTATAGCAGCATGATTTA 

7AA7CC777GGG7A7A7ACCCAG7AA7GGGA7GGC7GGG7CAAA7GG7A7 

77C7AG77C7AGA7CC7TGAGGAA7TGCCACAC7G7C7ACCACAA7GG77 

3AATTAGTTTATAGCCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCA 

CATCCTCTCCAGCACCTGTT G TTTCGTGA C TTTTT A GTGATTGCCATTCT 

AACTGGCACCACAGTAAATTTTTATAGATTTTATAAGCAAATTGTATTTA 

CTGTGCAAGAATTGGTTTATTTTTTAAACCATGTGTTGCAAACATACAAT 

GGTTAATTGTGATATTTGCTCAGTACAAGATCATCAGATCACTACACAGA 

CTTGAGGTAATTCCACCTAAAAGCAAAGAGAACTGACCCCACATTAACTG 

AGAAGTCTTTACTTATTTATTCCCTATAAACGAGCCAATATGAAGAGAAG 

GCCTTAATGTGGTTAACTATGTAATTTTTTTCTGACTTTTTGAAATACTG 

AGAAGAGCTCATGACTCTCCCATCTCCTAATTCTACCTTGGTGGATTTTA 

G ACTGAC CACAACTCA TGGGTAA ATGAGGGAAGACGAATAAGAAACCTTG 

C'rrrrrTTTCCTCCTTGTTTTTGGCTGGCTGCAGTGGCTCACACCTGTAA 

TCTCATCACTTTGGGAGGCCAAGGTGGGAAGATCACTTGAGCTCAGGATT 

TCAAAACTGGCCTGGGCAACATAGTGAGACCCCATCTCTAAAAAAAAAAA 

AAAAAAAAAAAAAGGCGACAGGCGGTGCGTGCCTGTAATCCTACCTACTC 

AAGAAGCCGAGGTGGAAAGATCACTTGAGCATGGGAGGTCAAAGCTGCAG 

TGAACC7TGATTGCACCACTTCATTCCAGCCTGGGTGACAAAGCAGGACG 

CTGCC7CAAGAAAACAAAAACAAAACCTTAATTTTTTGGCTATTCTTTTC 

TGG7AAGAA7GG7A7AGAGA7GGGGA7GAGGA7GGC7A77G7A7GAGAGA 

GCAAACAGGG7CCAAGCAG7GC7C7GGGC7G7C7AAGGACCAG7AG7CAG 

C77AAC77C7CAAA77TCCAGGGAAGGAG77CGGAG7GG7AGAA7A7CC7 
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GGGTATGCCCAAAGCATC^CCTTGCAAATAGCCTGTCATGAATAATTTG a 

TTCATTTGTTATGACTGGAAACTGGCTTTGTGTATGCCAGAGAATGGGGG 

CAGGAAAGAGAGATTGGTGTCTTGAGCTCTCTGTGCCTCTGGGGCAGTGA 

TGCTTTTCCTCTCATGTGGAAGGAGAGCATGACTGAAAAGGTGCACAAAT 

AAGGTGTCTGTGAGAGAAATTAAC CTTCCAGATACAGAGACACAACCTTC 

CCCAAGAGGTCCTCATTGCTCTGCCTTTTTTCCTTTTTTTTGCTTGTTCT 

AQCAT7AATAACAGAAACTGATTATGACCTCAAAAGAGAGGAGAAAGCGA 

C TCTCC C CAC CCTAGAGCTAGTTAACCACCATATCTTC CTAGATATC CTT 

GAGAGCAATGTAACCC 

>Contig29 

GTGAACTCGTTTTACCTGTGTAGCAGACCAAGCCGCAGACAAAATCCNTC 

AGACAC CAAATTAAAGAAGGAAGGGCTTTATTGGGCCTGGAGCTGCGGCA 

AGACTCACGTCTCCAACAACCGAGCTCCCCGAGTGTGCAATTCCTGTCCC 

TTTTAAGGGCTCACAACTCTAAGGCGGTCCACATGAGAGAGTCGTGATAG 

ATTGAGCAAGCAGGGGGTATGTGACTGGGGGCTGCATGCACCTGTAGTTA 

GAATGGAACAGAACATGACAGGGATCTTCACAGTG C TTTTCXTATGCAAA 

TAACCGATTAGATCAGGGGTCGATCTTTACCAGGCCCAGGGTGTGTCACC 

GGGCTGTCTGCTTGTGGATTTCATTTCTGCCTTTTAGTTATTACTTCTTT 

CTTTGGAGGCAGAAATTGGGCATAAGACAATATGAGGGGTGGTCTCCTCT 

CTTACCTGCGGGGAGTGAGCTCAAACTCCTTAAAGGAGTTACCTGCCTTC 

CATCATCAGGGAAGCAGGAAATCTTGCCTTCCTTGTTGGAAGCAAGTAAA 

ACTCAAAACAAACAAAGAAAAAAACAGGGAGTTG7ACAGCAAAATAAACT 

TTTGATTTTGACCAAATTTTGGGAGATCAGGAATTCTCTGAAGGAGATGC 

TTTCAGACCTCAGCAAATTGTCCTGTTGGTTTGAGCCATAAAGTTAGCTC 

ATGCTGGTACCAAACACCAGTAGGAGATTTGTCAAAGGTAAGAGGCATCT 

CCACTCAGAATCCCTTCGTGGTTACCAACATGTGAACCTTGGAAATCTGA 

GACAGGTCTCAGTTAATTTAGAAAGTTTATTTTGCCACGGTTGAGGACAC 

CCACCCATGACAGAGCATCAGGAGGTCCTGACCACATGTGCTCAGGGTGG 

TCTGAGCACAGCTTGGTTTTACACATTTTAGGGAGACATGAGACATCAGT 

GAATATATGTAAGATGTACACTCKjITCCCTCCAGAAAGGCAGAACAACTT 

GAAGCAGGGAGGGAGCTTCCAGGTCACAGGTAGGTGAGAGACAAACAATT 

GCATTCTTCTGAGTGTCTGATTAGCCTTTCCAAAGGAGGCAATCAGATAT 

GCATTTATCACAGTGAGCAGAGGGGTGACTTTGA ATAGA ATGGGAGGCAG 

GTTTGCCCTAAGCAGTTCCCAGCTTGACTTTTCCCTTTAGCTTAGTGATT 

TGGAGGCCGCAAGATTTATTTTCCTTCTACATCACTGTGGGCAGCTGACT 

AGGAAAGCTTTGTAGGACTGGTGGGCAGTGTGAGAGCCCAGTGGGGGGTG 

GTGGTCCTGTGCCAATGGTAGCAACCACCTGTGAGGCTGAGTAAACTCAT 

rTCCCAACCTCCTCTAGCAGCCCCAGTGGAGATACAGAGGAAGCAGACTA 

GCGATACAACCCAGCCTGAAGTTTTGTCTGGTGAGTGTAATGGAATAAAA 

ATGGGAAGGGTGCTGAAGAGACCAGCAAGAAAATGGTTGAAGAGATGGGG 

CACAGAAATTAAGCTGGATCAAAAAGGACGGAAAAGCAGAAAGGGCCGAT 

AGAGAGAGGGGATATCTATGGGTTCGCGATTCTGAAAAGGACAAATCAC7 

GGTGCTTTGAGAAGAGAGAGGGTGAGAAAGCAGGAAGGCTGGAGGCTGTC 

ATCCAAGAGGCGGACATCTGTGAACATGATTCCAAGAGTCACCAGACCAT 

GGGGGTGGCCAAAGGGAGTGCCTCITCTCACCTCCTACTCTTAATTCCTT 

GTACTCAAGATAATAAGTTCCCAGAAGAGAAGTACCCATATTTAATTCAT 

CTGTGTCTTCCTAGCAGTACTAAAAATATTATATGAAAGGTATCAAACCT 

TTGAGAATGTGTGCTGCTAAATTGTTAAGGATGCTGGAAAACTCAAGACG 

TCCCTGATCCTGAGCCTGAGTATGAGCCTGTGGTGAGCCCAATGCAGGTC 

TCCATTCAGACAAAGGCCTCAGGGAACGGATGAGACCTAGGGACAGAGAT 

GCATGCTGGAGCAGCATTCCCCATCCCTACTGCAGCTCAGGCCAGCTGAC 

TGCTTTATGAGTAAACGTTACCAGGGAACACTTTGCAGTCTTAACACACA 

TGCCCACCTGTGACCACTGATCCCTGTTGGGTGACCACTGACATCAGAGA 

TTCGATGGCAGCAATGAAGACAAGGCTATCCTCATTAGGAAGGAAAGGAA 

GGAGGAGGGAGGAGGGCAAACGAATCTTTCCTGCTTGTCAACCACGTCCA 

TCTCTGTTAGG7GATTTCCCATGTGTGACTTTGTTTATCTTTATAATAAC 

TCTGAGAGGTAGGTCTTGATGTCCACATTTTGAACATGAGGACATCCAGC 

CAGGAAGTTGAGTTCTGGGGACATAGCTGAGAGGGCAAAGCTACATATAA 

\CCCCTCTTTGTTTTTTCTGGCTTATCCACTGAGTGCCCCCTGCAATCCA 

CCAGCCGATTTGTGAAGTGCATACTATAGGTAAGTTGGCACAGGAGGAGT 
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GGATGTGGGCGATTTTG ± ^CAGC7CTCCAGGAAC77ACACAC7GG7GA^' 

GAGGGCCAGG7A7GT7CC7GACCAG7CACAA7CAAAGCAACCTCC7AC7A 

ATCAGGGAGGCTTGGTACCTGGGGAATGCTATGTTGAAAGGTTCTTTTCT 

GGG7TT7AAAA7GA7GGG7CTA7T7CCT7A77C77AAGA77GC7777T77 

CTGGCTAGAACTTAAAAGAAATTTTCAGTAAAATTTCCCTTCCCTGGCAC 

AAAGTGAGCTTGAAATGAATTCCCAGGTGGCCTTGATACTT7AAAATATT 

GCCTCCTATAAAATCAACCTTTAGAAGAAGGAAGTCAAAGAACATGCTAG 

ATTTCACAAAGGTTAATTCCTTGAAATCCAGTTATCTACAGGACAATGTT 

GTCAAAGAAAAAATTATTTGGCCAGGCACGGCGGCTCATGCCTATAATCC 

CAGCAC7TTGGGAGGC7GAGGCAGG7GA7CACC7GAGG7CAGGAG7TCGA 

GACCAGCCTGGCCAACA7GGTGAAACCCCA7C7C7AC7AAAAA7ACAAAA 

AAAA7TAGCCAGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACACGGGA 

GGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGAGGAAGTTGCAGTGAG 

CCAAGT7CAAGCCACTGCACCCCAGCCTGGGCAACAGAGCAAGACTTTGT 

CTCCAAAAAAAAAAAAAATTCAATGATATTTTTAAATTCATGGTAAGGAA 

GA7T7CA77CAGAACCAGCACAGAAGATATAGGAAACAC7GCAA7GGGAC 

TTTGCGGTGGGGGAGAGAGATTGAACACAACTACATATACAGCACGGGCA 

AGGACATATTCATAGCCAGGAAGCAGAGCAAAGATCAGTGGATGCGAAAT 

7AC7AAGAGGAAACA7GAAAAA7AAGGGAGC77C7GCC7AAACCCACCTA 

ACCGGA7CC7TGCTGAAGACAGGACAGGG7GA77GGACACCAC7T7GGGG 

ATGGTGGAGGATGGGGAATCCAGTGAGATTTCAAGGGTGATGCGATATTG 

AACATACAAAGTTCTTGCTAAAAAAGGATT7TACAAGAAAG7GTACAAAT 

G7GCG7GGGACAAGG7GCAGGAGCCCGACGGAGA7G7GG7CCAGCAGAGA 

A7A7G7GCCGAGA7GA7AGG7GAGT7CTC7GACGAAGGATA7A7GC7GA7 

CCAGCCAGGG7GAAA7GC7CAGAGAAAGCACGGAGGGGCTA7G7CCG7TG 

CCCCAG7C7CCACGCGG7CAAATCTGA7CCCG77G7GAG7G7GGCCG7T7 

G7AGAAAGCAA7CAGGGGGGG7CCC7CCCC 

>Concig30 

AATA7A7A777T7TATANNA7NTGAGACAGGT7C7CAC7AGG77GCCCAG 
GCTGG7C77GAA77CC7GCCTTCAAG7GACTCTCCCACCTTAGCCTACTG 
CATAGCTGGGATTACAGGCACAAACCAC7GCA7GCAGCTAAC7T7GCTTC 
7CA7TCCAGCACTTTT7A7TCCAC7GA77A7A7G7A7A7G7A7A7C7GCA 
7CA7CTC7CTC7C7CTC7CTCTCTC7CTC7CTCTATATATA7ATATATA7 
A7GGAAA7A7C7C7CTCTCTCTCTA7A7A7ATA7ATGGAAA7ATA7ATC7 
CAG7C7C7CC7A7CCTCCTTTAA7CAGTT7TGCTATCC7G7CAA77CCCC 
CAACGAG7G7GA7G7TG7GAAATATA7A77TG7TCT7CATC7CC7G7TTC 
C7GACA7ACAGC7777AAAAACCCTTGGAA7C7C7GGAATAA7AAGAG7G 
TC7T77GCA7GC7AATAGATGAC7GC7GGC7GGCAGCCCCAA7GCAG7AG 
C77CA7GA7GGGG77TG7CACAGGAAAGACCAAGGCAGGA7TGGAGACT7 
GAGAC7G77AGCCCCAC7CCCCAACCACTGGAGGGAGTGGAGGGGC7GAA 
GG7TG7G7CAG7CACCAATGGCCAATGGT7CGGTCAATCA7G7GTA7GTA 
A7AAAGC CAC7CT7AAAAACCCAAAAAGGACAGGGT7TGGAAGGGC7CCC 
AGA7AGC7GGACACATGAAGGTTCCTGGAGGG7GGTGCCCCAGAGGGGCA 
TGGAAGC7CCACACCCCTTCTCACATGCTTTGCTCTGCGCATCTCTTCAT 
C7GGTG77CA7CTGTATCCTTTGTAATATCTTTTAGAA7AAACTGG7AAA 
C77AAG7G7TTTCCTGAGTTCTGTGAGCTGCTC7AGCAAATTCACGGAAC 
CCGAGGGAAGCAAACCCAGATTTATAGCCA7CAG7CAGAAGCA7AGGTGA 
CAACC7ACCAC7TGTAACTGGCACCTGAAG7GGGAGGCAGTC7TGTGAGA 
C7GAGCCC7CAACCTGTGGGATCTAACGCTAACTCCAGGTAGATAG7GT7 
GGAG7GAA77AGGACACCCAACTGGTGTCGGCTGCTGGAGGACTAGTGG7 
GGGAGAAA7CCCCAAGCATTTCGG7GACTAGAGGTCACAGAAGAACTCAG 
7G7TGAGG7G77G7GACAGTATGG7AGGGAAAAC7GCG7CTGGTTTTTTC 
C77TTACAA7CAG7TAAATATTTAACACAAGTCTACTGTATATTAG7AAA 
AGGG7TACATTTT77AA7GTCTTGACAG7TGCAC777GACAACT7CGA7A 
TCAATCACT7TTT77CG7G7CCG7TTGGAACCAAAA7CACT7GGGA7ACC 
A7GAACCAGGC7GCAGCG7ATTCCCCAGGCC77GAAAGC77GGAGGCCAT 
777GCCAGCCOTAA7CCC7GTGAA7ACCAGGCT7CG7GGATT7AAAAAAT 
AGAC77GAGGCCAGGCC7GGTGGC7CACACCTG7AAGCCCAGCAC77TGG 
GAGGCAGAGGCGGA7AGA7CACAAGG77AGGAG7TCGAGACCAGCG7GGC 
CAACA7GG7GAAACCCGG7CTCTACTAAA7A7ACAAAAAAAAAT7AGCCG 
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GGCGTGATGTTACACGwJAGTAGTGCCAGATACTCAGGAGGCTGAGGLaG 

3AGAAA7AC7TGAACC7GGGAGGCAGAGG7TGAAA7GAG7CAAGA7CG7G 

CCAC7GCAC7CCAGCT7GGGCGACAGAG7GAGAC7CAG7777CAGGGGAG 

TTAAAACAATACAAAAAAAGAAAAAGACTTGAACAATGAGGCTCCACTGG 

A7GGA777AGGGGAA77ACAGGAAGCAGGACC7GACGG7GCAA7GCCACA 

CTCCACCTG7CCAGAATTGGACCTCACCAAGGGAGGTCTGTGGGGACAGG 

GAGAGGCCC7C7GCC7CCACCCCC7CCTC7AC7CCCCAAACCCTGAG7CA 

G5C7GAA7G7AG7AAACC7GGAACAGAAAAG77CAG777GGCAA7AGG7A 

TCTGAAGGACTCCAGGTGCTTCTCCCTTGATTCAAAATTTTACTTATAAA 

AAAAA77A7AAGAAAA77CTACT7AAAAGAAA7AA7CAGGGAGG7ACAAC 

AAATTGTACT T I IT 1TTTTTI 1111 11111 1 ' 1 T GAAATGGAGTCTCACTG 

TTGCCCATGCTGGAGTACAGTAGTGTGATCTCGGCTCACTGCAACCTCCG 

CCTCCTAGGTTCAAGTGATTTTCCTACTTCAGCCTCCCAAGTAGCTGCGA 

TTACAGGTGTGTGCCACCACACCCGGCTAATTTTTGTATTTTTGGTAGAG 

ACGGGGTTTCACCATGTTAACCAAGATGGTCTCGAACTCCTGACCTCAGG 

TGACCCACCTGCCTCAGACTCCCAAAGTGTTGGGATTACAGGGGTGAGCC 

AC7AAGCCCAGCCATTCTACATA7TT7GTGGG7A7T7AC7AAAACATTA7 

TCAAAATAGTAAAAAAAAAT7GAAATAAACTGGGGACTGGTTAAATAATT 

TTGGGTACAACCACATGATGGAATACTATACAGCCATTAAAAATTACATT 

GAGGCCAGG7GTGGTGGCTCATGCTTGTAATCTTAGCACTTTGGGAGGCC 

AAAG7GGGAGGA7TGCTIGGACCCAGGAGC7CAAGACCAGC77GGGCAA7 

GTGGCAAAACCCTGTCTCTAAAAAAAAAATACAAAAAAAATTAAAAAGCT 

GGG7G7GGAGGCACACACC7C7AG7CCCAGC7AC7CAAAGGGC7AAGG7G 

GGAAGA7CAC77GAACCGGGGAGGTCAAGGC7GCAG7GACCCAAAA7CGG 

G7CA77GCAC7CCAGCC7GGGCAACAAAGCAAGACCC7G7C7CAAAAAAA 

AAAAAAA7ACA77GAAGAA7A7C77ACGG7A7GGA7AAATA77CAT777A 

CAG7GA7AGA7GCAAATAAAAGCAAA77ACAAAA7A7ACAG7TTAA77CC 

AAC777GA7AC7ACA7A7G7ATA7A7GAA7ACA7GCA7A7G77A7G7A7G 

7ATATG7AAATATAACAATATATG77CTA7A7A7GGATAT7A7A7A77TA 

CACA7ACATACACACATATATAATA7CTTC7C7AGAGAGCAGAAAGAGAG 

7AGACAGA7AA7GAAGA7AGGA7ACAACTCCAG7CCAGC7CAACC7AGGG 

GAC7TGT77TAAAGCCTCAGGAGAGAGAAG7TGGGAC7AGAAAGCAAGGC 

AGCTAT77 G7AAG CATCTTTGTG777CATGCTATTGGGGTGGGAAACAAC 

AGCACAAC7TT7GAAAGCCCC77TC7ACTCACCCCACAAACTGCAGAGCA 

GC7TTAGGACCC7CAGAGTTCAAGAAGACCA7TTGCAGAG7AGAAGAAG7 

AAAAACA7G7A7GAAC7TGACCC7GAGCTCATGGAC7G7GCCA7GAGGGA 

AA7TCC7AAAACAGCAGGAGAGGCCC7GGAGGAAGGCAGAGGCCCTGCA7 

-AGCAAG7CCAGGCAAAAGCCTGCAT7CCA7AGATGC7CATC7C7C7GGC 

TGG7GAGG7C7AAAGACGTT7GG7C7CAA7A77AAG7C7CG7GAGAGAGG 

7CACAAACCCAGTCCCTTGGCCACAAAAGGAAA7AAA77C7GGC77GAGA 

CA77AGGGAGGAACAGGGCAAGGGGAGGT7CAAGAAAG77TTAATGGATG 

AGA7GA7A777AAGCAAGGCCC7GGAAAA7GAGAA777CAACCAA7AGCC 

A7A7GG7AGG7CAGAAAGCAAAGA7AAGGAGGGGGCAAG7GCAAGGGGCA 

ACA7GAGA7A7GACCAGGGTGTCG7GGGGCATGGCTGA7GGAGAAGAAGA 

77AGAC7GGAG7TTGGGAATGCCACAGTATCGAGGTTGGA777AA7CCTA 

7GGGTAA7AAAGCCAACTGTTCAACCCCCAACCCACTTGCAATATGGCTC 

CAAAA7AGCAGGTGTTTGATAAAATGACTACTTT7ACTCTACTATTCCCT 

CCC7C77AAGAAGAAAAAGAAAGTGGAGGC7CAGAGAAAGGCAG7GGC77 

G7CCCAA7CACACTATGATTTGGCCACAAAACAAGAACGAAATGTTACAC 

CCAAAAA7GCTGCCTCCACCTCCCTrC C TT GCr TTCCTCCC7GCTGGAC7 

ACAGAC7ATCTCAAGAGTGACGTACACCATCAGGGCTTCAGCTTTTCCCC 

GAAACAA7GCCAAAATATTAGCCA7ACGTCACTGTAGTAAGAGCCC7GAA 

77GGGAA7CCCAGCTTTGACGCAGACATGCTGAT7GAC7C7G7GACCATT 

CTC7TCAC7TC7CCACTCTAT7CTTCCCCACC7GTAAAG7GAGGTCCTT7 

CCAG77A7AAAAACAGATGATGC7A77GTCCTG7T7TG7ATCTAATCTTG 

C7G7G77A7AAAAAAAAAATAAGGC7CTGTACA77CA7C7TGGCCAATTC 

CCT7CTTA7C7C7ACTTCCCACAGCCCCT7T77C7ACAGAAAACCAGCA7 

7G77C77C7GGA7CCATCTCT7AAGAAAGCGC777GCC7CCCCGG77A7T 

7AGG7GA7AAGAAGTGTCCTAGA7GACAGCCC7GGAA7GGGC7GGAGGCA 

ACAAAAAAGCAAG7GAAATAGACAG77ACAGCGACGACAATAA7AACAAC 
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CAACACCTCTCACTAAAGAGAAAGAAATAAAAAAGAAAATTAAAATCTGC 
C GCAATGC CCACACAGTCATTGAATAACTGCATGTGTACAGCACTTGGTT 
ACTTTTACATACTTCATATTTTAGCCTTCATAGCAGCTCACAGGGGTGGA 
TTTAATTTTTAGTCCAACTCCTGTCACGGTGCCTGGCACAAGTATAATAA 
ATGTTCTGTGAATAAATGACCCTCTTTTTAGATGAGGAAATCGAGGCTCA 
AGGAGAACAAGCAATGTAATGTCCCCCTCCTGTTCAGCCATCTGCCTTTC 
ACGCCACTGAATGCAGTAGTCCTCAGTGCCCTGAACTTGACCCTCTTCTG 
CTTTTCGGACTGGTCCTTCTAATCCCGTTGTGACTCACTACACCACCTCT 
CCTGCATATGACATCTACATTTTAAAACAAACCGTATGGAAATAACACAT 
TAGTCGGCTTGTTCCCCCACCCCCGCAAAAAAAAAGGCCTCTTTATAACA 
GAAACTTCTCAGGCTGGTAGGGGAATTTTATTCCCCCATTTATGGTAGAA 
AGGC C Z TAAC CTTGGAC CT CACG C CATAGCTATT CACATGGGGGAATGAT 

GAATAACATGGGGAGCAGCATGTAAATATCATTGAGCCGTAGTCCAGACC 
TATAACACATC 
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GGGGGAGC7GCATGTGCCTGTCGAGATCTGGGGGAGGAACA6GAAGATCA 

AGAGTTCTGTGTAGGACATGTTAAGTTGAAGGTGCTTACAGGATAGCCAG 

ATGAAGCATCAGGTGTGCAGTCAAAGATATGAGTCTGGAGCAGCACATCC 

TAAGTCACCTCCTGCACCAACACAGAACTTCCAGGCCACTCACTTGAGCT 

CTCCCAAATAGTTTCCAAGTGTCATTATGTTAATAACCTATGAGCTTGAA 

CACCAGATTCAAACCCCACTGCATGGCTTTTAAAGACCATCTCAAGGGCT 

TGACACrCCAGGGAGCCAACTAAAGATGCCTGGTCCTACCATCAACCTCC 

ACCCGATTTTTTATAGAAAATGTTTCTACCTGTCCTAAGGCAGGGTCCTG 

CCCCACTCCCAGGCCCCTTTAGATCCCCAATATTCCTCCTCCCTGAACCA 

AAACCCT-CATCATCTTCCAGCATGGGTGGGGCCTCCATTCTTGCTTCTGC 

TCCC CTGAGCAGAAGCAAGTTTCTCCCAACTTGACCTGATTCTCCTCCTA 

AGTACCAGTCACTGCTTTGTTTCTGGAATGAGAGAAAAAGACAGAGTGAG 

AGAGACAATCCAGAACTCTTGCTCACTCACAGCTAGGCTGGGCATCTGGG 

AGGATGGCTGTGTCCATGGGAACCTGGGAAAAGCCACACCCTTGGCACCC 

TGGTCACCCACCTGTCTCCCTGGCAGATTCCGCACTGCTCTCTTGCACCC 

TCTACCAGGGCTAACCGGCCTGCTCACTCTCCCCAGCATGTCTTCCCACG 

CCCACTCTCTAATTATTACATTCCCTTCACATAAACTGCCCTTCTCTCCC 

AATCACCACATGTTCACTTCCCACCCAGCTGTCAAAGTCTGGCTCAACCT 

CATTCTTGAAAAGGAAAAAACAAACAAACAAACAAACAAACAAGCAAAAA 

ACCTATGATGGATTAAGAACACACTTCATTCCAGGAACATGCTTATCTCC 

TCTAACTCTCACAACAACTACAGCAGGTAGGTGTTATCACACCCATCTCT 

CAGGTGAGAAAACAGGCTCAACGAGTGCAGGAGGACACAGCAAGTCAGTG 

ACAAAGCTTAAATTCAAGCCCAAGCCTGTTGGCAACCAACGTCTGTACCC 

TTGATAGCTACCTCATTTACCACCAAATCCAGTGGCCTCAGGCCTGGCTG 

CACACTGGGATCACCTGGTGCCCAGACCACATCTTAGACCAGTCATACAG 

AATCTCTTGGGCTGGGATCCTCCACGGTACATTTTAAGGGTCCCCAGGTG 

AGTTCCACCATGGACCCAGAATTGAGGACCCAATACCGTATACCATCTCC 

TTCTTCATCTCTTCTAAGGCATCTCTTACTCGCTGTGCACTCCCATACCA 

CTTTGTTCAATCATCCAATCATTCATTCATTGAGTCAGTTAGTCAGGAGC 

TACTCACTAGTCCCCTGCCAGGTCCTAGTCAXGACATAGGGCTCTGGGGA 

CCAACAAGAAGCAGGACCCATGCCTCCTGCTCTCATGGAGCTTGCTCTGC 

AGCAGAGGAAGCAGTCAGTGAGATGTAGCAAATGTGAAATGTGCACAGAT 

GGGAAAAGCAAAACTTTAAAACTTTTAGGACAAAATACACAAGAAATCTT 

TGCAACTTTGGGACAGGAAGGAACAACATTCCTTACACATGACACCAAAG 

GAATCAACCATAAATAAAAAGGTGATCAATTTGACCTCATTTAAGTGTTA 

AGCTTTTTTCATTGAGAGACACCATTAAAAATTAAAAATACATGCCACAA 

ACTGGGATACAATATTTACAACACTTATGTCTCACAAAGGATTAGTTTTC 

AGAATATATAAAGAACTCCCGGCCGGGTATGGCCGCGCACGCTGGAATCT 

CAGCACTT7GGGAGGCCAGCGGATCACATGAGGTCAGGAGTTCAAGACCA 

GCCTGGCCAACATGGCAAAACTCCGTCTCTACTAAAAATACAAAAATTAG 

CCAGGCATGGTGGCGGGCGCCTGTAATCCCAGCTACTCAGGAAACTGAGG 

CAGGAGAATCACTTGAGCCCAGAAAACAGAAGTTGCAGTGAGCTGAGCTC 

ACATCACTGTAAGCCTCGGTGACAGAGTAAGACTGTCAAAAAAACGAAAA 

CAAAAACAAAAACTCCTACAAATAAATAAGAAAAAAATAGCCCAGCAGGA 

AAAAGTATATACATTTCATAAAAGAATAAATACATTCTGTCAGTTTTCTA 
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ACA7A7A77777TAAGAGTAAA7ACAAA7GG7TAGGAAACA7TT7TrAAA 
A7GCCCAACC7CA77AAAAATTA7AGAAG7GAAAA77AAGCCACAA7AAG 
ATACGATTTTATACCAAATACAGTGTCAACACTTTGCAAGTCTGACCTCA 
CCAAG7G77ACCAGACGTGTGCAC7GACG7GGC7GC7GAGA7AC7GA7GG 
7GGG7C T 37AAA7CTGTACTACAAACAATTGCAA7AAAATG7AA7AAATA 
7ACAA7AGG7GGAGCAGGAAGTGACCTGCAACCA7A7AGCAGA7AGGGCA 
GGAAAAAGCC7A7GAAAGCTGACA7CAAAGGGA7AAG77CCAGTTACCCA 
G CTGAAGGG AAGGAGGGTGTTTCAGATAGAGGAAGGATAAG CATGAC CTA 
77CAAGGCCAG7GAAAGAAGCGTGCAACGGCCAAG7CAGGAGAACCTGAA 
ATTGTGTCAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACTATTGGGGGT 
TTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAAGGTCACTGG 
CACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGTCTGTTTTGG 
AAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGGTCACAAAAC 
TTGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTGTGGACTGTG 
GCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAAGGCATGTGG 
GAGGCAGAAGGAATCAAATACAATGAGTGATCAGATGTGGGGTTAGAGTG 

GGTAAGACAT7CATGGACTAGGA7CGAGGAANGAGG7GGGGAA7GGGACC 

ATACCTGCAGTTTATAAGGGGTGGACGAGGGAAGATTATGCGGGAGACTG 

AGAGAGGAATAGACAAAGGAATCCCGGTGCAGTATTACAGAAACTGGGGT 

GGGAGGGGG77G7ANTTCAAAAAGGAAAGAAAA77G7CAAA7AG7A7GAA 

ATGCTGCAGAGAAACTCACGGATTTTTTTTTTAAGCTTAGAATTATTCAT 

7GAC7A7G7GAA7AAGAATAAC77T7A7GAAAGAAG7TT7GC77AAG7AG 

7AGGAAGAAGCAAAA7TGTTGAGGGC7GA7GAG7GGGAGGAGAAGTAA77 

GAAGGCAC7C7T7CAAGAGAAACAAAGCAGAAGG7GAGGAGAA7AC7AAT 

GAAGGAGT7ACGGCCTTCACTATTTTGTTTTGCTTTAGATAAGCAAGACT 

7GAG7GGG7C7GG7GAGGAGAAACAAG7AGAG7ACAAAGT7AAAGGAGAG 

ACAGACAGAGATAGAGATAGGGACAGAGAGAGAGACAGAGACAGAGCACA 

AAAGAGCAAGGTCCCTGAGAACACGGGCCTTCTGTTTAAACCCCAGCCAG 

ATGTATTGCAATTCAATTCCAGTACTAACCACCCAGAGTTTGTGTAGACT 

CTACAAGTTAAAGAGCATGGTCCCCAACAAGACTGCTTCTACGTCAGATG 

CCAGGCACACTTCAGGGGTCCCCAAGCCACTCATGTTTTTTGAATGACTG 

CCATAAGTTCAAAAATTCCCACAATTCTCTCAGATTCAATAACTGGGTAT 

AACCACTCATAGAACTCAAGAAAATGCTATCATTATTATTACAATTTTAT 

TATAAAGGATACAAATCAGAAGGACTAGCCAAATGAGGAGACACATAGAG 

AGAGGACTAG7AAAAAACAGAGCTTCTGCGTCCTACCTTCAAGGAATCAG 

GATGCACCACCCTCCCAGCACATCAAGTGCTCATCAACCAGGAAGTTCCT 

C7GAGC7C CAA7G7CCAGAGATTT7AGGGAGGATTCA77ACA7AGG7A7 C 

ATTGA7TAAATCATTGGCCATGTACTTGAACTCAATCTCCAGTGTCCCTC 

77C7CCC7AGAGG7CTGAAGGGTTGGC7AATA7CATG7GGC7CAAAGCCC 

CAAC7C7AAT7ACC7TTTTGGTCTTTrCAGGGAC7AGACCCCA7CCTGAA 

GCTA7C7ACAGGCCCTGCCATGAGTTAGCTCATTAACATAACAAAGACAC 

7TA7A77AC7CAGAAAATTCCAACAGTTTTAGAAGCTCCATG7CAGGAAC 

CTGGGAGA7AGA7CAAATT CriTriTTlUUUl ' lTlUTriXJ GAGACAGGGT 

C77GC7G7G77GCCCAGGCTAGAG7GCAAGGACAGA7CACAGC7CAA7GC 

AGCTTCAACTTCCCAGGCTTAAGTGACCTTTCCACCTTAACCrrCCAAGT 

ATC7GGGACCACAGAAAATGGCTAATTA7CCTGGCTGATTT7TAAACTTT 

7TTTrr77G7AGGGATGGGATCGCCCTG7G7TGCCAAGG7TGG7CTCAAA 

CTCC7GGG77CAAGCAATCATTCTGCCC7GGCCTCTGTGATGGTTAATAC 

7GAG7G7CAAC7TGATTGGATTGAAGGATACAAAGTAT7ATTT7TGGGTG 

7G7C7G7GAGGG7GTTGCCAAAGGAGAT7ACA7TTGAG7CAG7GGACTGG 

GAAAG7CCACCCTTTCCCAGTGGACTGGGAGACCCACCC7CAA7CCAGGT 

AAACACAA7C7AA7CAGCTGCCAG7G7GGTCAGAATAAAAGGAGGCAGAA 

GAACAGGGAAACACTAGACTGGCT7AGTCTTCCAGCCTACATCTTTCTCT 

CATGC7GAA7GCTTCCTACCCTCGAACATCAGCCTCCAAGTTCT7CAGTT 

7TTGGAC7C77GGACCTTCAACCACAGATTGAAGAC7GCAG7G7TGGCTT 

CCC7G7T7T7GAGGTTTTGGGACTCAGAC7GGCT7CCTTGC7CC7CAGCT 

7GCAGA7GGCCAA7TGTGGGAC7T7AAC77G7GA7CA7G7GAG7CAA7A7 

7CC77AA7AAAC7CAGA7ATATATATATG7A7CAGACA7ATATATATA7C 

C7A77G7A7A77A7ATACAGATA7A7AA7ATCC7A77ATATACAGATATA 


FIG* 3 (24 of 52) 


WO 99/06426 


PCT/US98/16102 


rAA7A7CCTATTATATAuAGGTATATATATATATATG7ATCATATA7AXA 
7A7CC7A77GG77C7A7CCC7C77GAGAA7CC7GAC7AA7ACAGCC7CCC 
AAAA7GC7GAGA77ACAGGAG7GAGCCACAGCCACCA7GCCCAGCCCCAA 
A77C77AA77A7ACAACAA7GGG7CCAGAGA7CAGGGCC7GGG7AGGA7G 
CAGCAATAAGAAAACAGATGGTGGA7GGGGACACA7G77GGAAG7G7GGC 
AGGACA7GGC7GAGGGAAC7CA7AGGA7GG7G7C7A7777CA7GGC7GAG 
7G7GAGGAACAGCA7AAGG7CAAAA777CAGG7CAA7GG7GAG77T T T7A 
AA77G77GC 7G7G AACCCCAAAAA7C7GACCCAGG7C7CAG77AA777AG 
AAAGTC7A77777CCAAGG77GAGAACACCCACCCAC7CACGACAAGAGC 
A7CAGGAGG7CC7GACCACA7G7GCCCAAGG7GG7AAGAGCACAGC77GG 
7777A7A7A7777AGGGAGACG7AAG7CA7CAA7CAA7A7A7G7AAGA7G 
TACAC7GG77C7GCC7AGAAAGGCAGGACAAC77GAAGCAGGGAGGGGGC 
T7CCA7G7CACAGG7AGG7GAGAGACAAACAG77GCA77C777GAG777C 
TGA77A7CC777CCAAAGGAGGCAA7CAGA7G7GCAA77A7C7CAG7GAG 
CAGAGGGA7GAC777GAA7AGAAAGACAGGCAGG777GCCC7AAGAAG77 
CCCAGC77GAC77777CC777AGC777G7GA777GGAGGCGCCAAGA777 
A7777CC777CACA777CCCCCCC777C77777AAGAA7C7777AAAGAA 
AGC7777AAAAAGAAAA7GAG7C7C7GG7CCCAGG777CA7C7GAA77C7 

CGAGGGGAGGA7GG7TTATCCTAAACGGG7GG7TCTGAAT7TTGAGAAAG 
7GCA77G7AC 

>Concig32 

AAAAGCCA7ACGAA7GAGGAAGAA77AAGGGCCAGAACAAAACAAGAAGA 
rGAGGGAAAG777GGAAC77C77AGAGAC7GGC7AAA7GG77G7GACCAA 
AA7GC7GA7AG7GA7ACGGACAA7GAAG7CCAGGG7GACAAAG7C7CAGA 
TGGAAA7GGGGAA777G77GGGAAC7GGGCAAAGG7CACCC77GC7A7GA 
C7CAGCAAAGAAA77GGG7GCA77G7G77CA7G7CC7GGGGA7C7G7GGA 
AG777GAA7G7AAGAG7GA7GAC77ACGG7AGGG7A7C7AG7GGAAGAAA 
CC7C7AAGCAACAAAG7G7G77GC77AGAAA777C777C777C7777777 
777777777GAGC7GGAG7777GC7G7G7CGCCCAGGC7GGAGCGCAG7G 
GCGCAA7C77GGC7CAC77CAAGC7C7G7C7CC7GGG77CA7GCCA77C7 
CC7GCC7CAGCC7CCCAAG7AGC7GGGAC7ACAGGCGCC7GCCACCA7AC 
C7GGC7AA777777AG7A7777AG7AGAGACGAGG777CACCA7G77AGC 
CAAGA7GG7C7CAA7C77C7GACC7CG7GA7CCACCCGCC77GGCC7CCC 
AAAA7GC7GGGG77ACAAGCA7GAGCCACCCCGCC7GGCC7GC77AGAAA 
777C7AAGCCAGGA7A7GGCC7G7C7GC77C7AACAGCC7G7GC7CAGGG 
G7AAAGAAA7GAC77AAAG77GGAACC7A7G777AAAA7GGAAG7AGAG7 
C7AAAAA777GGAAAA777GCAGCC7GGCC77G7GGCAGAGAAAGAA7CC 
AAG7AGGC7GCAGAGCAATCATTGCTAGAGAGATTAGCATGAC7AAAAGG 
GAGCCAAG7GC7AA7A77CAAGACAA7G77AAAAAGGCC77GAGGGCA77 
7CAGAGA7C7A7GAAGCAGCCCC7CCCA7CACAGG7GCAGAGG777GG7G 
CAC7AGGCCCAGAGG7777A7GGGCCANNGCCAGGGCCACAC7GC7A7GC 
ACAGC777GGGACAC7GC7GCCCGCA7CCAGGCCAC7C7GC7C7GGC7CC 
ACCC77GGC7CAAACGGGCCAAGA7AGAGC77GGACCAC7GC7CCCGAGG 
GCACAAGCCA7AAGCC77GG7GG777CCA7G7GG7G77AAGCC7GCAGG7 
GCCCAGAA7GCAAGA77GAGGGAGC77GGGCAC77CCACC7AAA777CAG 
AGGA7G7G7CAGAAACCC7AGG77CCCAGGCAGAAGCA7GA7ACAGGGGC 
AGAGCC G77GCAGAGAACC7C7AC7AGGGCAA7GCCAAAGGAAAA7G7GG 
GG77GGAG7CC7CACACA7GG7CCCO^C7GGGGCAC7ACC7GG7GA7AC7 
G7GGGAA7GGGGC7GC7GCCC7CCAGACCCCAGAA7GG7AGA7GCAC7GG 
CAGC7GGCACCC7GAGCC7GGAAAAGC7GCAGGCAC7CAAC7CCAACCCA 
7GAGATCAGCCACATGGGCTACTCCCAGGGAAGCCCACAGAGGCAGGGCT 
G7C7AAGGCC77GGGAGCC7ACCCC77GAACCAGC77GCAGGACA7GGAA 
TCAAAGA77A7G77GCAGC777AAGGC77AA7G7777CCC7G7CAA777C 
AGGC77GTGTGGGACCTGTTG C TTTTTTTTTTTTTTr fTT TTTTTTTGGT 
CACAGG7G777GAACCAGAACAA77CCA7C77GAA7AGGGGC7GGG7AAA 
A7AAGGC7GAGACC7AC7GAGC7GCA77CC7AGGAGG77AGGAA77C7AA 
G7CACAGGAGGAGA7AGGAGG7CGGCACAAGA7ACAGG7AGCGAAGACC7 
CGC7GA7AAAA7AAG77GCAG7AAAGAAGCCAGCCAAAAC7CACAAAGCC 
AAAA7GG7GA7A7GG777GGC7C7A7G7CCCCACCCAAA7C7CA7C7CAA 
A77A7AA77CCCA7AA7CCCCACA7G77GAGGGGAGGACC7GG77GGAGG 
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7GATT0GATTA7GGAGGCAA777CCCCCA7GC7G77C7GG7GA7AC7GAG 

7GAG77CT CA7AAGA7C7AA7GG7777A7AAG7G777GGAAG77CCT CC7 

ACACACATGC7CACAC7CTC7CC7GCAGC777A7GAAGAAGG7AC77GC7 

77CC777C7GCCA7GAT7G7AAG777CC7GAGGC77CCCAGC7ATGCAGA 

AC7G7GAG7CAA77AAACCCG7777C777ATACA77ACCAG7C77GGGCA 

377C777ACAGCAG7G7GAGAAC7GCTGGCGA7GAGAG7GACC7C7GG77 

GZPCTCACTGCTCATTATATGCTAATTATAATG7ATTAGCATGCCAAAAG 

ACACTCCCACCATGACCCCAACAGTCATGCCTGTGCCGGTCTCAGCACCA 

7GACAG77TACAGA7GGCA7AGCAACG7C7AAAAGG7ACCCCA7A7GGAC 

TAACAAGGGGAGGAACCCTCAGCTCTGGGAAGTGCCTACCTCGTTCCCAG 

AAAGC7TG7GAA7AATCCAC7GC77G777AACA7A7AA77AAGAAA7AAC 

TA77AAGCA7CC77AG77CAGCAGCCCAAGC7GC7G77C7GCC7A7GGAG 

7AGCCAT7C777A77CCG77AC777C77AA7AAAA7TGC7777AC7TTAC 

TGTATGTACTCGCCTGGAATTCTTTCTTGTACGAGGTCCAGAGCCCTCTC 

TTGGG7C7GGA7CGGGACCCC777C7GG7AACAT777GACCAA7T7C7CC 

C77CTGGAATGGGAATG7TTACACAA7GAC7G7A7CAC7777GAA7C77G 

GAAGTAAATAATTTGTTTTTGACTTTACAGCCTCATAGGTGGAAGGAACT 

TGACTTGAATTTCAGATGAGACTTTGGACTTTGGGACTTTTGGGTTGGGG 

C7GGAATGAG77AAAAG7TGGGGGGA7TA77GGGAAGGCACGA7777A77 

77GCAATA7GAGAAGCACATGAGAT77GGGGGACCAAGGGTGGAA7AA7A 

TGGTTTGGATGTTTGCCCCCTCCAAATCTCACATTGAAATGTAATCCCCA 

G7G7TGAAG7GAGGCC7GC7GGAAAA7G777GGA77ACAAGGC7G7CGAG 

CACAT7GGATAAGACG7G7AGGNCCC 

>Conric33 

CGCAGCTCGC7GG77AA77C7G7GGC7CC7G7GACCAC7A77A7AGCACC 

AGG7CTATGACCAGGAGAATTAGACTGGCATTAAATCAGAATAAGAGATT 

T7GCACC7G TAATAGACCTTATGACAC CT AAC CAAC C C CA7TAT7TACAA 

T7AAACAGGAACAGAGGGAA7AC777A7CCAACTCACACAAGC7GC77TC 

CTCCCAGATCCATGCTTTTTTGCGTTTATTATTTTTTAGAGATGGGGGCT 

TCACTATGTTGCCCACACTGGACTAAAACTCTGGGCCTCAAGTGATTGTC 

CTGCCTCAGCCTCCTGAATAGCTGGGACTACAGGGGCATGCCATCACACC 

TAGTTCATTTCCTCTATTTAAAATATACATGGCTTAAACTCCAACTGGGA 

ACCCAAAACATTCATTTGCTAAGAGTCTGGTGTTCTACCACCTGAACTAG 

GCTGGCCACAGGAATTATAAAAGCTGAGAAATTCTTTAATAATAGTAACC 

AGGCAACACCATTGAAGGCTCATATGTAAAAATCCATGCCTTCCTTTCTC 

CCAATCTCCATTCCCAAACTTAGCCACTGGCTTCTGGCTGAGGCCTTACG 

CAT AC C7 C C C GGGGCTTGCACACAC C77C77C7ACAGAAGACACAC C7TG 

GGCA7A7CC7ACAGAAGACCAGGC77CTC7CTGG7CC77GG7AGAGGGC7 

AC7T7AC7G7AACAGGGCCAGGG7GGAGAA77C7CTCC7GAAGC7CCATC 

CCC7C7A7AGGAAA7G7G77GACAATA77CAGAAGAG7AGGAGGA7CAAG 

AC77C777G7GC7CAAA7ACCAC7G77C7C77C7C7ACCC7GCCC7AACC 

AGGAGC77G7CACCCCAAAC7C7GAGG7GAT77A7GCCrrAATCAAGCAA 

AC77CCC7C77CAGAAAAGA7GGC7CA7777CCCTCAAAAG7TGCCAGGA 

GC7GCCAAG7A77C7GCCAA77CACCC7GGAGCACAATCAACAAA77CAG 

CCAGAACACAAC7ACAGC7AC7A77AGAACTAT7AT7AT7AA7AAA77CC 

TC7CCAAA7C7AGCCCC7TGAC7TCGGA7rrCACGA77TC7CCCT7CC7C 

C7AGAAAC77GA7AAG7T7CCCGCGCT7CCCTT777C7AAGAC7ACATG7 

T7G7CA7C77A7AAAGCAAAGGGG7GAATAAA7GAACCAAA7CAATAACT 

-C7GGAATA7C7GCAAACAACAA7AA7ATCAGC7A7GCCATC7T7CAC7A 

7T7TAGCCAG7A7CGAG77GAATGAACA7AGAAAAA7ACAAAAC7GAA7T 

CT7CCC7G7AAAT7CCCCGTTTTGACGACGCACTTG7AGCCACGTAGCCA 

CGCC7AC77AAGACAA77ACAAAAGGCGAAGAAGAC7GAC7CAGGC77AA 

GC7GCCAGCCAGAGAGGGAG7CA777CA7TG GCG77 7GAG7CA GCAAAGG 

TA77G7CC7CACA7C7C7GGC7A77AAAG7A7777C7G77G77G77777C 

7CIT7GGC7G7777C7C7CACAT7GCC77C7C7AAAGCTACAGCC7CTC^ 

777C7777C77G7CCC7CCC7GG777GG7A7GTGACC7AGAA77ACAG7C 

AGA777CAGAAAATGA77C7C7CAT777GC7GA7AAGGAC7GA77CG777 

TAC7GAGGGACGGCAGAAC7AG777CC7A7GAGGGCA7GGG7GAA7ACAA 

ITGAGGC77C7CA7GGGAGGGAA7C7C7AC7A7CCAAAA77A77AGGAGA 

AAA7TGAAAA777CCAAC7C7G7CTC7C7C77ACC7C7G7G7AAGGCAAA 
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7ACC77A77C77G7GG7G*'rT777G7AACC7C77CAAAC777CA77GA77G 

AATGCZTGTTCTGGCAATACATTAGGTTGGGCACATAAGGAATACCAACA 

TAAATAAAACATTCTAAAAGAAGTTTACGATCTAATAAAGGAGACAGGTA 

CATAGCAAACTAATTCAAAGGAGCTAGAAGATGGAGAAAATGCTGAATGT 

GGACTAAGTCATTCAACAAAGTTTTCAGGAAGCACAAAGAGGAGGGGCTC 

C Z -7CACAGA7A7C7GGA77AGAGGC7GGC7GAGC7GA7GG7GGC7GG7G 

77C7 C7 377GCAAAAG7CAAGA7GGC CAAAGTTC CAGACA7G777GAAGA 

CC7GAAGAAC7G77ACAGG7AAGGAA7AAGA777A7C7C77G7GA777AA 

7GAGGG777CAAGGC7CACCAAAA7CCAGC7AGGCA7AACAG7GGCCAGC 

A7GGGGGCAGGCCGGCAGAGG77G7AAAGA7G7G7AC7AG7CC7GAAG7C 

AGAGCAGGTTCAGAGAAGACCCAGAAAAAC7AAGCATTCAGCATG7TAAA 

C7GAGA77ACA77GGCAGGGAGACCGCCA7777AGAAAAA77A777TTGA 

GG7C7GC7GAGCCC7ACA7GAA7A7CAGCA7CAAC77AGACACAGCC7C7 

G77GAGA7CACA7GCCC7GA7A7AAGAA7GGG7777AC7GG7CCA77C7C 

AGGAAAAC77GA7C7CA77CAGGAACAGGAAA7GGC7CCACAGCAAGC7G 

GGCA7G7GAAC7CACA7A7GCAGGCAAA7C7CAC7CAGA7GTAGAAGAAA 

GG7AAA7GAACACAAAGA7AAAAT7ACGGAACA7A77AAAC7AACA7GA7 

G777CCA77A7C7G7AG7AAA7AC7AACACAAAC7AGGC7G7CAAAA777 

7GCC7GGA7A7777AC7AAG7A7AAA77A7GAAA7C7G7777AG7GAA7A 

CA7GAAAG7AA7G7G7AACA7A7AA7C7A777GG77AAAA7AAAAAGGAA 

G7GC77CAAAACC777C77T7C7C7AAAGGAGC77AACA77C77CCC7GA 

AC77CAA77AAAGC7C77CAA777G77AGCCAAG7CCAA777T7ACAGA7 

AAAGCACAGG7AAAGC7CAAAGCC7G7C77GA7GAC7AC7AA77CCAGA7 

7AG7AAGA7A7GAA77AC7C7ACC7A7G7G7A7G7G7AGAAG7CC77AAA 

777CAAAGA7GACAG7AA7GGCCA7G7G7A7G7G7G7GACCCACAAC7A7 

CA7GG7CA77AAAG7ACA7TGGCCAGAGACCACAC7GAAA7AACAACAA7 

7ACA77C7CA7CA7C77A7T77GACAG7GAAAA7GAAGAAGACAG77CC7 

CCA77GA7CA7C7G7C7C7GAA7CAGG7AAGCAAA7GAC7G7AA77C7CA 

7GGGAC7GC7A77C77ACACAG7GG777C77CA7CCAAAGAGAACAGCAA 

7GAC77GAA7C77AAATACTTT7G7777ACCC7CAC7AGAGGTCCAGAGA 

CC7G7C7TrCA77A7AAG7GAGACCAGC7GCC7C7C7AAAC7AA7AGT7G 

A7G7GCA77GGC77C7CCCAGAACAGAGCAGAAC7A7CCCAAA7CCCTGA 

GAAC7GGAG7C7CC7GGGGCAGGC77CA7CAGGA7G77AG77A7GCCA7C 

C7GAGAAAGGCCCCGCAGGCCGCT7CACCAGG7G7C7G7C7CC7AA7G7G 

A7G7G77G7GG77G7C77C7C7GACACCAGCA7CAGAGG77AGAGAAAG7 

C7CCAAACA7GAAGC7GAGAGAGAGGAAGCAAGCCAG77GAAAG7GAGAA 

G7C7ACAGCCAC7CA7CAA7C7G7G77A77G7G777GGAGACCACAAA7A 

3ACAC7ATAAG7AC7GCC7AG7A7G7C77CAG7AC7GGC777AAAAGC7G 

7CCC7AAAGGAG7A777C7AAAA7A7777GAGCA77G77AAGCAGA7777 

TAACC7CC7GAGAGGGAAC7AA77GGAAAGC7ACCAC7CAC7ACAA7CA7 

7G77AACC7A7T7AGTTACAACATC7CA77777GAGCA7GCAAA7AAA7G 

AAAAA7C77CC7AAAAAAATCA7C77777A7CC7GGAAGGAGGAAGGAAG 

G7GAGACAAAAGGGAGAGAGGGAGGGAAGCC7AA7GAAACACCAG77ACC 

7AAGACCAGAA7GGAGA7CTTCCTCACTACC7C7G77GAATACAGCACC7 

AC7GAAAGAAC777CA77CCC7GACCA7GAACAGCC7C7CAGC77C7G77 

T7CCT7CC7CACAGAAA7CCT7C7A7CA7G7AAGN7A7GGCCCAC7CCA7 

GAAGGC7GCA7GGA7CAA7CTG7G7C7C7GAG7A7C7C7GAAACC7C7AA 

AACA7CCAAGC77ACCTTCAAGGAGAGCA7GG7GG7AG7AGCAACCAACG 

GGAAGG77C7GAAGAAGAGACGGT7GAG777AAGCCAA7CCA7CAC7GA7 

GA7GACC7GGAGGCCA7CGCCAATGAC7CAGAGGAAGG7AAGGGG7CAAG 

CACAA7AA7A7C7T7CTT77ACAG7777AAGCAAG7AGGGACAG7AGAA7 

77AGGGGAAAA77AAACG7GGAG7CAGAA7AACAAGAAGACAACCAAGCA 

77AG7C7GG7AAC7A7ACAGAGGAAAAT7AA77777A7CCTTCTCCAGGA 

GGGAGAAA7GAGCAG7GGCCTGAA7CGAGAA7AC77GC7CACAGCCA77A 

777C77AGCCA7A77G7AAAGG7CG7G7GAC7777AGCC7TTCAGGAGAA 

AGCAG7AA7AAGACCAC77ACGAGC7A7G77CC7C7CA7AC7AAC7A7GC 

C7CC77GG7CA7G77ACA7AA7C7777CG7GA77CAG777CC7C7AC7G7 

AAAA7GGAGA7AA7CAGAA7CCCCCAC7CA77GGA77G77G7AAAGA7TA 

AGAG7C7CAGGC777ACAGAC7GAGC7AGC7GGGCCC7CC7GAC7G77A7 

.^AAGA77AAA7GAG7CAACA7CCCC7AAC77C7GGAC7AGAA7AA7G7C7 
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GG7ACAAAG 7AAGCACC^-A A7AAA7GT7AGC7A77AC7A7CA77ATTA;*. T 
A7TA7777A77777777T7T7GAGA7GGAG7C7CAC7C7G77GCCCAGGC 
7GGAG7GCAG7GGCGCAA7C77GGC7CAC7GCAAGC7C7GCC7CC7GGG7 
7CACGCCA7777CC7GCC7CAGCC7CCCGAG7AGC7GGGACAACAGGCA7 
G7G C CAC CATGC C CAGCTAATTTTTTTGTATTTTTAGTAGAGATGGGGTT 
TCACTGTGTTAGCCAGGATGGTCTCTATTTCCTGATCTCATGATCCGCCT 
GCCTTGGCCrCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCG 
G<?C7A77A77A77A77ATTAC7AC7AC7AC7ACC7A7A7GAA7AC7ACCA 
GCAA7AC7AA777A77AA7GAC7GGATTA7G7C7AAACC7CACAAGAATC 
C7ACC77C7CA7777ACA7AAAAGGAAAC7AAGC7CA77GAGA7AGGTAA 
ACTGCCCAATGGCATACATCTGTAAGTGGGAGAGCCTCAAATCTAATTCA 
GTTCTACCTGAGTAAAAAAATCATGGTTTCTCCTCCATCCCTTTACTGTA 
CAAG C CTC CACA7GAAC7A7AAAC C C AATATTC CTGTTTTTAAGATAATA 
CCTAAGCAATAACGCATGTTCACCTAGAAGGTTTTAAAATGTAACACAAT 
A7AAGAAAA7AAAAATCAC7CATATCGTCAGTGAGAGT7TACTAC7GCCA 
GCACTATGGTATGTTTCCTTAAAATCTTTGCTATACACATACCTACATGT 
GAACAAATATGTCTAACATCAAGACCACACTATTTACAACTTTATATCCA 
GCTTTTCTGACTTAGCAATGTATTGATGACATTATGCATGCT7AGACCTC 
C 

>Concig34 

G7A77C7A77C7CGG77A7AACACAA7CACAG7GA777G7CA7A7C777C 

CAGGATTTGTTAATTTCACTTCTTCAGCTGTTTCCCCCTTGTTGGCTGGA 

AC7G A7777 C7A7 C77 C7GGGAGAA7C77CAGCAAGC CAAC7 CAGGATTT 

GTTGGGTGCATTTTGTCAAGTCTAGGACCCAGGCTCTGGGTGACTGATTT 

C C7 C7AA77 AC CGAGCAA7G7AAAA7GAGGAAG7C7GA77G7G7AAAGG7 

GTTAAACTTTTGTGTGACGGCAAAACTTTAATACCATGAATAGAGATTCC 

AGAATTTTCCAACTTCTAACGGGATTCCTTTCACTCCCtGACATTAGAAT 

GTTAGAAAATCTACCACAAAACATCTGTGAGGCTATCCTACAAGGCCCGT 

TTTTCAAAATAGGTTTTTACAAGGATTGCTATTTGGGATGATAGTTTCAG 

AAAGGCGCTATCAAAGTTAATTGATGATGTGTGCAAGCTGAAAGTTATAT 

GTTAGAACTAGCAGTGATTTCAAAAATATCCCTTTTAGGCTTTTTGCTAA 

TATATCTGCTCATTTTCAAAGTTCCCAATATTATAAAACTTTTTAAAGCA 

GAAAGAAGAAGCCTCCATTTCTGCTGGCCCCTTCCCTGTTCAACTAAAAA 

GTATTTTCCCAGGCAATGCTATCCCAGGACTCACACTCCATCCATCCATC 

ACCTACCATAAGTTCTTTGAAGGGCTCATTCTGAGCGCTTCCTGAGTGCC 

TGGGATCTGTTATTTCTCTCCATTTCTGCTGCTGCATGGTAGTCCAAGTC 

C7CC7CCC7777CCCC7AGGCCA7T7GAA7CA7C7GC7AA77GG777TCC 

7GA77GCCACGGAAAC77CC7CCA7CCC77CC7CACATA7CAGCCACAGA 

AG7A7C7CCAAAAAGCAAA7C7GG7GACA7GAAGCCCT7GCACAAAACCC 

A7TCA77AC7GG77CCACACC7CC7TTG7GGATAAG77CAAGCTCC7GAG 

7G7GGCAAGCAGGGCCCACC7GGAATCCCC7GCCC7CC7C7CCTATCCCA 

CGCA7CAA7C777CG7GTCTAT7TGCAGTrCCTTGAATGTGA7A7TCTTT 

C7AG7C7C7G7GC7777GCA7AACC7G77C7TCC7GAC7GGAAAC7CC77 

C7CC7CC77GTAGTT7GGCTAATTTCTAGTCTTTCAAGACTCAGCTCATG 

CTTCACCCCC7CTA7AA CAAG TCCTTTCCqAGCTGGGTGG7GGATGCTC 

C7CTG7GC7G7GTGAGTCTTGAACATCC7CAGCAAACCTaVGCTTTGTT7 

GCT7G7C7CCC7TGCTGTCAATGCACCTGA7TCAGGGCTGGCATATACTG 

77CACC7CCA7GAC7GGCTCATGGTGGTGCTCCGTGAATATCATCCACCC 

AAACGGA7GAGAGC7ACCATGCCATCAC7TGTGACT7CCATCTGGAGCTA 

ACC7CCCCCGACAGGAAAGCGTTTCC7TAGGAAAGAATATCTTTGGGTTA 

AATAGAAG7AGAGAC7CACCAGAAGCAC7A7GTCCAGCTCAGAA7GAAC7 

GC7CAG7AAGCAGCCT7G7CAA7GAGGAGGCAGCAGGCCAGCCCCAGAGG 

CC7CAAAG7GGGAGAG7AGAGAAGCGCAG77CC7GCCACAAAGGCACAG7 

GGACACC77GC7CCCC7GGCTGGCTGGAAGCAGATGGTG7CCACCTGCTT 

CCA7GGG AA77C7GCACCTTTAATAAAGTTTTA7GGGACAGGAAGGTGAC 

7GGCA77GACA77G7AACGAGGAA7GGG7GG7GCCACCT7TGC7G7GTCT 

7ACCAGAAA7ACC7GTGGCAGG7AAATTTC7AGAGAGACCC7CCCATTTC 

7CCCA7A7AGCAATT77GAAA7GTTTCCTGAGGGC7TTCCAAAT7CATC7 

GGGAACA7AGGAG77CCAGAAAGATGAAA7CAAAGG7GA7GG7A7GCCAA 

AGAAAG7AGC7777AGAA7GAC77ACA77AGCCA77CA7CCA77CAGCAC 
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ACCAGGCATTCAGTTTOAGGGGTGTGTGTGTGTGTGTGCGCGCGCGCGxG' 

CGTGCATGAGTGCATGCGCGCGCGTGTACATAGGGGAAGGGAAACAAAAC 

AAAAGTACACAAGACATGATAGTTGTCCTCAAGGAGTTTTTGCAAATGTT . 

CACAATTTAAGAGAATATGCTGTGCTGTGGCTGGTGTATAAACCAACTGC 

TAGGGAGAGGCCTTCCACACACACTTGGGGCAAATGCGACC7CTAGGACT 

GCCAGTGGAATCTGGGCATGCTGTTTGTGGTCGATAAACCCTGGTCCCTT 

GATCAGGGACCTATG7TTACTTTTCCTCTCCCTGGAAGTCTTCATTAGTG 

GGtATCCAGAAGGTCTTGCACAGGGCAGAGGGAGGCACAAAGACAAGAGT 

TTGAAACCAGCCTGGACAACAAAATGAGTTTCTATCTTTACAAAAAAAAT 

TTTTAAAAAATTAGCCAGGTAGGATTGCATGTGCCTGTAGTCCCAGCTAT 

TCAGGAAGCTGAGGCAGGAGGATTCCCTGAGACCAGGAATTTTGAGGCTG 

CAGTGAGCTATTAAGTTGGCGCAAAAGTAATCGTGGTTTTTATCATTAAA 

AGTAATGGCAAAACTTTTAATGAO^AAAACCGTGATTACTTTTGCACCAA 

TTTAATATGATTGCACGACTGCACTGTGCTCCAGCCTGGGCAACAGAGTG 

GGACCCTGTCACAAAATAATAAATAAATAAAATGTAAACATGTAAAAAAA 

ACCCCAAAAACAAAAAAAATGGGTGTTGAGACCCCTGAATTGAGGAATAA 

TAGGAAGGAGTGTGATTCTGTGTGTGCATGCATGGGTGTGCACCCTCAGT 

GCCTGGGTGGCTTACCCTGGGCTAGTTCAGGTGGCAAATGG7TTTCCTCC 

AGCTGGGCTACCACCATCTTCCCCCAGGGCCTGTCCATGTATTTGGTGGC 

AAGATACCTATGGACTAGAGTCCCTCCTCAGAGGAAAGGCTCCTCCCATT 

TCTCTGGCTTTCAGGTAGTAGTCCATGACTTCAACAGGTCCCCAGTGCAA 

7GTTATGGGTTAGTTTAGGTGGGGTCTCCTCTGAGAGCCTCCCATAGCCC 

AAAAGGCCCTGTCCTAGCTGGCACTGCATCTCCCTCTTCCCAGCTCTCAG 

CCTTTCTCTTTGCTCATCCCACTCCGCACAGGCTTTCTGCCTGATCCTTG 

GATGTGTCAATCCTGCCCCTAAGGGATGCAAGGCAATTTGTCCTTTTATT 

ATTAAGATCTCTCCTGAGGCCACGTGTGGTGGCTCACACCTGTAGTCCTA 

GAACTTTGGTAGGCCAAGGTAGGAGAATTGCTTGAGCTCAGGAGTTCCAG 

GCTGTAG7GAA CCAT GATT GCACCATT GCATTCCAGCCTGTGTGACACAG 

CGAGACCCTGTCTTTTTTCTTTTTTTTTTTGAGACAGGGTCTCGCTCTCT 

CATCCAGGCTAGAGTGCAGCGGTGTTTTTCTGCTCACTGCAGCCTCAACC 

TGCACATTTTTT G TAGAGACGGTGTCTTGCTATGTTGCCCAGAGTGGCCT 

CAAACTCCTGGGCTCAAGAGATCTTTCCACCTCAGCCTTCCAAAGTGCTG 

GGACTACAGGCGTGAGCTACCGCGCCCAACAAAGACCCTGTCTTAAAAAG 

AAAACAAAAATAAACAACTCCCTCAAGT C rrTTTT riTT TTTTGAGACGG 

AGTCTCGCTCTGTCGCCCAGGCTGGAGGGCAGTGGCGCAATCTTGGCTCA 

CTGCAAGCTCTGCCTCCCGGGTTCACGCCATTCTCTTGCCTCAGCCTCCC 

3AGTAGCTGGGACTACAGGTGCCCGCCACCACGCCTGGCTAATATTTTGT 

ATTTTTAGTAGAGATGGGGTTTCACTGCGTTAGCCAGGATGGTCTTGATC 

TCCTCACCTTGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGAT7AC 

AGGCATGAGCCACCGCGCCCAGCCAGACCTCTTGAGTC7TAAACTCC7CT 

GTAGTTCCAGCCACCCTTTAGCACATGACTCTGTTAATTTTGTTCTCACT 

GTCTGAAATCATCTCCTGTCCACTCTTGACTGACAGGTCTCTGCACTAGC 

CCACTGCTTAATCAGAGTAGGTCCCTGTCAACTTATTCATATTGTGTCCC 

CATGCCAGTGTGGATGATTAAAATTGTTGAGTGGAGGCTGATCAGATGAG 

CCATCTCCTTCCAAGTCCTCACTTGCTGGCTCCTGTCTTAGTTTTAGTCC 

CCATTCTTCAAAGAACGTGAGCCCTGGAAAGTATTTTAGTaVTTTAGTTC 

AGTG CCTTTGGATGGGAGGATCACATCCCTGGGTCCCGTCCTGCAGAC7G 

TTTTGCTCTAGCTGACTAGGCAGGATTCCCTGCCTTCTCTCACTTCGGCA 

TGGGACTTCCTTCTGAAATTGCTGCTCAGTCAAGAGAATGACCTTCCCCA 

ACATAATCCTACTCCACAGGGACTTAAAGGTGTGTCAGAGATCTCTTGCT 

CATCTTTCTGGCCAGGTGCCAACGTCAGTTTATAGCCAAGGGACAAGACT 

AGTTAGCAGATCAGGCAGGTCTTAGACCCCAGCGTAAGTGCCAGACTTCT 

AGCTGCAG7TGTTCCTGCCCACACTGGGCGTTCAGGTGGAGAGAGGGCAT 

GGCACTACACTGAGCTCTCGGCGAAACCCAGGACTCTGAAATCTCGGTGT 

CAGCCACAGGCCACTCTTTTCAGCAGGACTTCAGTCAGTCC7GTCACTAG 

GCTGTCGAGCACATGGTAGGCTTTACCCC 

>Concig3 5 

AAGGAGTGTGCTTGCTGATAGCATGTGTGANGGGACGAGGAGTAAATAAT 
TTCTGCCTTCAAGAAATTGCAAACTAGTAATGGAGATAAAATCAACAGAG 
3AACAA7TAGAGTATAAGG7AAAATCTAAGGGCCATAAGAGAGGAGAAGA 
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AG7 A7GGGAG77 CAGAGu rAGGGGG7AAA7GAGGGGAG7AGG7GGG7AUA 
AAAGG77AAAAG7AAA7AA7GA7GGGAAGGAAGACAAAAAGACGACAGGG 
GTGCCAAAGGACTCTTAACCTCATC7GAACGGAG7TGCCCTGTTTTGCTC 
TC7GA7GC7CA7G7A7C7A7CC77AGAGACAGC77GGCGGGCAA7G7AGA 
GCGTAGGGGCTGACATAGGGGGTTGGAGTC C CAC CTCCGTGACTTCTAGC 
AAATTAGCAAACTTTGCTGCTGCTAAGCCTATAAGGCGGACAGAAATGCC 
A7C777AAAGC77GT7A7G7AAAG7GCC7AGGACC7CG7AGGCA7CAACA 
GGAATAATGGATGAAACAAAACAACGGTGCGTATCTTGGAGAAAGTGGCA 
TCTGAGCAGGAGTATTTTGAAAGGTAGGAAAGGGCTCCAAGCACATCTAA 
GAGATTAGGGAACGCAGAAGCCTTAGCCCTGGGTGCAGATTTAACCAATC 
AAC7TC7AACCACCGCAGGC7GAGAGG7G7GGAG7GAGAGCCCCGCCAGA 
3GCAGGAGACCCGGGC7TCGGCCAGACCCCGCC7CC7GGTACAGAGGACC 
ACGCCCGGCTCTGCCTGGAGCCAAATGTGGATCAAAACAGCGCGCAGCTT 
CCCACTGCTGGTGAAAACCCGAGCAAGGGGCCTCAGTTTCTTTATCCGGA 
ACGTGGTGACAATGACATCTCTTTGCAAGGCTGCTGCAGGGCTTTCTGGA 
AATACGCCCGTGAGGTATCTGGGCCTGCGCACAGCCTCCCCCGCCCAGGA 
CCCAGACGTCTACCTGGGGGTCCCGTCTGCGCTCCCGGGATGGAAAACGC 
CGAGGGGAAAC77AGGCAGGCGAGCGGACGGGCACC7CCCGCGGGACGAA 
CTCACTCGGTGGCCTCCTAOTCCCCGGCCGTGTTC^CGCCTGAGAAT 
AACGGGAACAGCGGTCGTACTCACCGACAGCGGCAGCAGCGGTAGGCCCG 
GGCCCCACCATGACTCTTCAGTGACAGTTTTTCTTCAAACGCCGCSCCTG 
TAGCCAGGACCGGCGTGCCGCGCGTCCACGCGTCCTCATTGGCTCCTGCG 
GGTTTGAAACTCGCTAGTCG7CAGCACGGGAGGGCGGGACAACAGGCAAT 
AGGC7C777GCGG77GGC7CTGGCC77GAGAACCCGACC77GGGGCCC77 
TGATTGGAAGAACGTGCAGCGCACCTCGGCATTGAGGGCGGCTTCCTCGG 
GGCGCGGCGCCGCCCGCCTCTGAGTGCGCCTGTGAGTGCGCCTCCGAGTG 
GGCGTGGGACCCTCCGTGGGGGCCTCAGCCGGGCTGGTGGTTGGGGGGCG 
GTTACGCTGAATCCAGCTGGGGTTGGCGCGCCGGGAGTCCCTGGGCGGAG 
AGACAGGGCGGTCCTCCCAGGATGCTGGGGCCGCTACCTGATTCTGTCCT 
TTCAAAGTCTCAGACTCACAGGAGCTGTGAAAAAATAATATTATAAAGAG 
GACATATGGGTCTTATGCATCTAAAGGCTCCTAGTTCTTAGTACTGCAGG 
GTGGCT CGTTT AATTGTGGTAAAATATGCATAACATCACATATACCATTT 
TAACCATTTTAAAGTGTTAAATTTTTCAAAAATGTGCAGTTTAGTGGTAT 
TAAGTACCCTCACATTGTGGCACAGCCACCACTACTGTCCTTTCCAGAAC 
TTTTTCATCTTCCCAAATGAAACCCTGTACCCGTCACTAACTCCGCACTC 
CTCCCTCCCCCAGCCCCAGGCAATCACCATTCTAGTTTCTGTCTCTATGG 
ATTTGACAACTG7AGGTGCCATATAAGTAGAATCATGCAGTATTTGTTCT 
37GAC7GGC77G77TCACT7AGCA7AAAG7A77CAAGG77CA7CCA7G7G 
7AGCA7G7G7CAGAAT77CC777CC7777AAGGGGGAA7AGCA777CG77 
G7G7GGAGA7GCCACAT777GC77C77GG7CCA7CCC7C7CCGGACAC77 
GAG77GC77CCACTTTT7GGCTA7TG7GAA7AA7AA7A7GAACA7GAA7G 
CACAAA7AAC7C7TTGAGAC7CTCC7777CA77C77T7GGGTA7A7ACCA 
CGAAG7GG7A77GTrGGATCAAACGGCAA77C7A77777AAT7TTrrGAG 

A777CACC7GA7CAGC7GACTA7AAGGCCATAAGGC7AACGGAGAAACGC 
AGGCC7AG777C7CCTAGTTACTAGGAGATCGCAGGCC7CGT7GTCCTGA 
A7CCC7AGACACACTTCAT7CCCCT7G77T7AA7CC7AAATTTTT77TCT 
T77GAAG 777G7 CCTGT7TCATCTA77C7CCAG777C77AAAGAGG7C7G 
GAAAA7GC777TGGC7CCT7GTG7A7GAAGG77CC7CTTCCATGGATGCT 
GGAGAAG7CG7G7GTGGAGGGGCAG7CATATCTGGGCACC7GTTGGCCAG 
377CAGC77ACCAGTTGGGTAC7CAGCAGGGCA7GAAGCCACTGCAGCAG 
CCC77C7C777AGCCGTAAA7AGGGAG777GGAAGAGAGCCAGGG77TC7 
GGA777A7GCA77TTGA7A7TTrCAA7AG7G7A77AAATG7TTAAAA7AG 
3AAAAC7GA7CA7TA7T7T7GTTAA7GACTGAGAAAGGGACTCCTTCACC 
AACAG777CAGAAAAGTGAAGGCGG7777GT77TGG7C7TTG7AGAA7CT 
AGG7GG77GAA7GCA7G7CAG77G7AGAAGTCACC77GCC7GA7A7CCCA 
CGCAG7GC7GGAG7ATTCCACAGACCCCA7G7AGG7AC7GCACC777GCA 
GG7A7AC7GC7GG7G7TGG7GAGC7GCC77ACC7G7CC7G77A77GGAGA 
Z CCC7GC77A77A GGAAA CTTAAAA7GAAC7CAAA7GAGC7TCC77GCrr 
AC7GG7CC7AG7CCT7TGGAGCAACA7AGGCCAG77CTGCC7CG777777 


FIG. 3 (30 of 52) 

W//r 


WO 99/06426 


PCT/US98/16102 


TCCATCCT . - ^G7A777GACGG7C7A7777G7AGGACACAAAA7G7GGG 

AAAATAGCTAGGCAGGTTTAAAAATTCTCAACTCTACCAAGCATC3G7GGC 

"T7A7G7C7G7AA7CAA7CCCAGCAC777G7GAAGC7GAGGCAAGAGGA77 

GC77GAGCC7AGGAGT7TGAGACCAGAC7GGGCAACATAGCAAGACCTCG 

777 C77AAAAAAAAAAAAAAAA77ACAAAAA77AAC CAGG CATGGTGGCA 

CACACCTC-TAGTCCCTTCTACTCAGGAGGCTGAGGTGGGAGGATCACTTG 

A5CC7AAAAG77GAAGGA7GCAG7GCAC7G7GG7CA7GCCACCGCAC7CC 

AGCA7GGGAGGCAGAGCAAGACCC7G7C7CCAAA7AAA7ACA7AAA7TAA 

A77C77AAC7CA77CA7CAAAG7A7CCAC7G7AGC777CCA7CA7CCTGG 

7G77G77777777AGAAGGA7C7GGC7CCA77GCCCGGC7AGAG7GCAG7 

GGCATGATCTCAGCTCACTGCAGCCCCCACCTCTCTGGCTTAAGCGATCA 

CCCACTTCAGTCACCCATCTGGGTAATTTTTG7ATTTTTTGTAGAGATGG 

GG7777GCCA7G77GCCCCAGGT7GG7C77GAAC7CC7GGC7CAAGCGA7 

CCA7C7GCC7CCA7C7CC7AAAGTGTTGGGAT7ACAGGTG7GAGCCACCA 

CACCAGGACAA7CC7GGTGGCTT7TAACGG777TCCA77GCTC7CAGGC7 

AA7GACC7A7AAGCCCC7GCGGGC77GGCC7777AC7CCC7SAGCA7TAG 

CCACC7CCC77AGCCT7AGCCCACAC7AC7C7CCCC77GC7CAG7G7TA7 

CCAGACAC777G777777CC777CCA7AC7CC7C7C7G7C7GGGAA7CCA 

ACCT77C777CTCATTTCTCTAGTTGATTATTAT7ATTTTTACTCTAGCA 

GCCT7AT7GAGATATTTACATACCGTACGATTCTCCCACTTACAG7GTAC 

AA77CAA7777C7AACA7777CA7CACCCCC7AAAGAAACCC7ATAC7CA 

77AGCAG7CAC7CCCCA77C7CCCC7CC7C7CAGCCCC7AGAAACCA7GA 

A7C7AC7A7CCA7C7C7A7AGA777GCC77C7GGACA777CA7A7G7A7G 

AAA77A7GCAA777G7GG7C7C7GA7GGGC77C7777G77ACCAAAA7A7 

CA7GGG777GA7C7AGG7CC7GC7GC7CGC7GCACAGAAAGCCAGCCAC7 

GAGA7GACAAG7A77GCCAAGGAAGAAGGC777AG7CAGG7GC7GCAGC7 

GAGGAGA7GGGGGC7CAA7C7CAAATCCA7C7CGC7GACC7AAAACCAGG 

GG777GGA7AGCAGGGAAGAAA7G7AACAA7GCG7AAGAAAACAGGAACC 

AGGGAGGGGCAAGGAAGCAA7CC7GA7GAA7GAG7GG7CCAAAG7C7CA7 

7GCC7GGA7G7GG7GA7C7GGCGAG777CAG77C777GA7AC7T7T7TTG 

AGAGGCC7GAAG7C7777CCCCAGGAAGGAAC7CAAACAAAACAAA7ACA 

AGC77CCAGC777AAGACCAGAAGCG7CAA777C7A7G777A7CCGAAAG 

AACAG7C7A7GGGAC7A77GG77AAG777CAC777CAC77AG7A7GC7G7 

777CAAGG777A7CCACA7AGCA7G7G7CAG7AC77CA77CTIT7A7GAC 

7GGG7A77C7A77G7GCGGATA7ACAA7A7777A777GCCA77CATCAG7 

7GA7GGACA7C7AGG77C777CCAC77777GGC7A77ATGAA7AA7GC7G 

T7A7GAAC7772A7G7A7AAG77777G7G7AGACA7A7G7777CAACAC7 

ZA7GGG7A7A7ACC7AA7GAGAGGAA77AC7G7G7CA7ACGA7AA77C7A 

7G777AACCA777GAGGAAC7GCCAGAC7G7777CCAAAGCAGC7GCAGC 

A7777ACA77CC7ACCAGCAG7G7A7GAAAG77CCAG777C7T7ACA7CC 

7CAACAACAC77G77A77G7CCA7C7777AAA77ACAACCA7CC7AG7GG 

77G7GAAA7GG7A7CACATTGTGGTTTTTA7TTG7A7TTCCTTGATGAC7 

AA7GA7G77AAGCA7C77777A7G7G777AC7GGCCA777G7A7A7C7C7 

A77CAGAG7C777GCCAATTTTTAAA77GGGTCAGTTG7C7TCTTCCTT7 

777777GAGA7GGAGCC7CACTCTGTT7CCCAGC7GGAATACAGTGGTG7 

GA7C7CAGC7CAC7GCAACTTCCACCTCC7GTG77CAAGTGATTCTGG7G 

CC7CAGCC7CCCAAG7AGCTGGGAT7ACACGCACC7GCCACCATTCCCAG 

C7AA7777777C777G7A7T77GAG7AGAGACGGGG777CACCA7G77GG 

CCAGGC7AG7C7C777G77GAC7C77AACCA7CC77CAG7C7CAGACAAA 

ACA7CCC777C7CAAGGA77G7GA77AGC77GA77A777GC77A7C777C 

7CCC7GC7AG7C7G7AAACTGAGGG7AGGCCAC7A7A77CA7TG7TC77G 

GCACCAAA7AGAAAC7AAATTAA7G7C7777GAA7GAA7AGGGC777C7C 

C7TT7AAAGA7CCCT7CAATACAG7AACCACAC7A7A7ATAAG7AGCCAC 

AAGCCCA77CAA7AA7AC7ACTAG7NC77GCGCCAAACC 

>Concig36 

GGC7CAGCG77AC7A7AC7GGTC7CAAAC7CC7GGGC7CAAGCGA7C7GC 
CCCCC7CGGC77CCCAAAGTGT7GGGAT7A7AGGCG7GAGCCACGG7GCC 
7GGCC7CAAA7AAC7ATTTAAG7GAAACAAAAC7AG7ATGGCAC7AA7GA 
AAAA7G7A7AAA7CCA7AATCGCAGAGGGATT7CAAC77AC77C777CGA 
77A7G7AAAGG7CAAACAGACAAAAGACAA7GACAAAACT7AA7GCAA7G 
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AACACTTTTGATTrAATGAACATATATTGGATATGTACCCAAGAATTAGA 

3AATACATACTAGTTTTGAGTTTATGCAGAACATTTACAAAAATTTAGTG 

3AAGCCTAAATTATAAAAAGTTGCTGTCACG7AGAATAACACACAAACCC 

3TGAGTCCGGAATTCAAAGCCCTCCACACTCTCCTCTACCTTTGCATCTT 

TATCCTCCACCACACTGCAGTGCATACTCTGGGCTACTACTCACTGTTCr 

TGATTCAAATTCCATGTTCTGTCAGCTCAAATCATTCTCTCTGCCTGGAA 

TA^CTACTTCATACATATTCTGCTATTGAATTCTTGTCTTAGCACCCCAT 

CTACTCCAAGACGATGTCCAGTTGGGGTTACTCCCTGTCCCATTTTCTTT 

GATTACACTTTTTTTTTCTACTTCCATTATATTATTGATCACATCTGTGC 

CACAGTTTTTGACTTTGTGTCTGCTTTTACTCTTTTCTAGACCCTGATAG 

CTCCTGAAGGGTTGGGTCATTTCTTTTTTATTTGCTCATTCCTCATGGCA 

CAGTGAGTGCTTAATAAATGGCTATTGACTGAAATTAAACTGTATCTAAA 

TGGACATATTCCACTTCTGGGCCATTCATTCTTTCTTTCTATTGGAACCA 

GGAGATGGGGAACCATAACAAAGGTAAGGTTGTGCCATGTGAAAGAACAT 

GGAACCTTCCCCTGAGGGCCAAAAAAGAGCAGGGAAAGGTGCAAAGACAA 

AATCTTCCATTTTTAAACAATGTAAGAATGTGGTCCACCTCAXGCTCAGG 

TGGGACTTTATCATGACGTTATTTTTGGGGACTTATAGCTGCATCATTTA 

CCCCATATACA7TTACCTTTAGTGTAGGGAACTGAGGACAGGAATTT7GT 

TGATGCAGACTCTTGCTAATGAGGCTAACACTTGGAGAA' I TTTTATCATG 

CATTCAAGAAGCTTGTTTTACATTTCTTCATTAATACTTTAGTTGGTGGT 

TTAGCTTTAGTTGTAGGCTTATCAGATATTTGGAGATATCTTCATAAACG 

ATGGCTTTGGTTTTAGAAGAGTTATTCTGAAGCTACTATTTCTGGCAATA 

ATCAAACAGCATGGCCATTTGTT7TGTAAGGCCTTTCCTAGAATATGACG 

GTAAAATCTACGTGTGGAAAAATGCTTATTCTTCTGTCCTCTATAAATGT 

GAATCTAGTTTGTCTTCAAAATGAAATCAAGTGATTAAAATGTAGTTT7C 

TAAGAAGATAAATGGAGCAAAGCACTCTGTGTTTCACAGTGTTGGAAATC 

ACTCATCCCTCATAAAACTGTCCCAACTGATCCTGACTCACATGAATGAA 

TTAAAATAAGAGTTAATAACATCAATTTACATTTTTAAAGACACTTTCCC 

ATGTTTTAG ACTAT TGGTTGGAAAAGCTGGTAGGTGTACAATTTGTGGAG 

AGTTGGCTGTTTTTGTCTGTCGTTGTTTGACGTATTTCAAAGCCATATCT 

AATTTTGTTGCAGAATGGTCTGAATTCTACAAAAATGTTGAGTTGTGTAG 

TGTGGAGAAGTACGGAGCCATTTACTGAAAGGCTGGGGGGAAATGACGAG 

ACCCTGAGATAAGGCAGTAGTGGTGCGAACAGAGTGGAAGGGAGGTAGTT 

GAGATATGTTCAGAGTAGAATCAGAATGGACATAGTGAACAACTGGATGC 

AGGTGGGGGCTGAGGAAGCAAAGTTGAGGATAATTCTGAGACTTCTAGGT 

TGATCCACTGAAGTTACATTATTCAACACCACAAGGAAACTAGGGGAATG 

AGAAGGCATACTGGTTTGCTTTGGAG7GGAAGGGCAGTGATGTAAGAGGA 

3TTAATGAGTTAAAGTTTGGATATGCCTGAACTTCAATTTGATATG7GCA 

7CTGATATACCCTTGGGGTGACCCTCCAGGCAATGGTTGAACATGTGTAT 

TTCTTAGTAACTGATAGGCATCACAGACTCACATCAGTAAGGAAGCAACA 

3CAAACTTGATTGGACGATATACCTGGAACTCAGTACCCTATGACTGGAG 

CAAGTCTCTGTCAGTGAAATGAGGATAAGAAGAATCTTGACCTTGTGGAA 

TATGTTGTTAGGAATATATGTGATGAACAACATAGGATACTTCCTACAGG 

GCTCCACATGTAGTAAGGGCTTTATAAATGCTTGATAAATATTATTGTTG 

TAATTTATTTCCAAAGTAAGATGCCACTGGAGGAATCTTTGGAACCCAAA 

TTAATAACAAATAGGACTGGATGCAATGGCTCACACCTGTAATCCCAGCA 

CTTTGGAAGGCCAAGGCAGGAGGATCTCTTGAGCCCAGAAATTCAAGACC 

AGCCTGGGTGACACAGGGAGACCTTGTATCTATGAAGAATTAAAAAAAAT 

TAACCAGATG7GGTGGTGCACGCCTATAGTCCCTGCTGCTTGAGAGGCTG 

AGGTGGGAGGATTGCTTGAGCCCATGAGGTTGAGGCTGCAGTGAGCCATA 

ATTGTGCCACCACACTCCAGACTGGGTGACAGAG7GAGACCCTATCTCAA 

ATAAATAAATAAATAAATAAATAAATAAGTACAAACCAGCAAACACTAAT 

CCTTTCTAGAGATTATTGAACTCTGGAGGGCAGATCTGAATGGAGCCAGC 

AGAGGGACCTATGGAGATCAGCCTGGCCCTGGACAGCACCAGGCAATGGG 

GTTGCTAGAGAGGTAATGGGGTTGAACAGGGTTTAAGCCATGAGGTCTCA 

AGAATCCGTGAAGACTCAGACTAA' l ' l"I"l"l l i"l"l"l"l"rr l 'GCATGAGGATTAG 

GTGTTCCTAGGAATTTCAATGAGAGCAGGGTTAATGAAGGAATGCAGGGT 

AGGAGAGCTGAGGGAAGGCATCTGAGAGAGCCTGGCTTATGAATGGCTGC 

3TCAGTATGGCTCACCTGCTTTCCTTGTATCTACTTAGCAGATGATCCCA 

ZCCCAGGCCTCCAGGGCCAAGGTCATTTCCACATAGTCATGGGCCCTTGA 
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3GGCCTGGAGCAGTG7A^oGAAGACAGAGTCTTAAGAAATT3CATtAAL . 

G7CA7GG7GC77GGCAAGTG7CGTCATCCTATGCCAAGCCTGATC7GAAG 

3GG7GCA7GC7CATAGG7AGC7GC7GCCCAAGAT7ACAGCAGC77C77CA 

A7CCCAGA7CCATGC7C7CC7ATATTCA7T7T7CCAGGGGTTCC7G7CC7 

7 CGACAG7GA7GAGA7GCAGAA7GAC77A77GAG77A77C7 C C7GA7AG7 

TGCCAAC7777CCAAA7GACAA7GGGGCA7GGAGC77GAGAG7GGAAA7G 

AGGC3C7AGGGA7AGCG7GC77AGGAAAACAC7CCCAGCC7GA7G7AA77 

275GGGG7ACAA7GGCA7777CA7CA7CAAGAC7GA7G7AAAGGG7GAC7 

AGCAG7GAG77GGGGGTGACTCGCACTGGGGCTAGG7T7CTGA77C7GCC 

7AA7CCAGACAGAGCAGAAGCACTAGTGGGCTGGTAGAGGGCC7CCAGGG 

CC7CAC77AA7G7CC7GGAAAAACAGC7CCAGA77G77GG77CACG77C7 

GAGGACAAGCT7GGGTACTACAGGATAGAGAGAGTGGTG6GAGAIGCCG7 

GGCC7GCCC7GCTGATGCCTGCCCTGCCATTCCTGCG7G7GA7G7C7C7G 

GGGCA7C77GCC77CCC7GCCCAGACC7G7AG77CAGC7GAGGGCA7G7G 

GAGGCCAAA7GGC77C77AGAG7G77AC777CC77GAACAGC7C7GC7GG 

GAGAAC73GAGGAGC7AGC7AG7CACGG7AAC7GCAGCAG7CAAAGGA7C 

G7CCCGG7GGAGG7GGGG7GGAAAGG7AGAGAAAGAGAACA7A7AGCG77 

TTCC77GGAGA7G7G7GGGCA7G7CA7AGAGGAAA7ACCCAA77CC7GAG 

CC77GAGCCC7CCAGGAAACC77GGAA7A77AGG77AG7CA7CCCCAAGG 

AAG7C7AAGAA77C7GG7C7CACCCA7C7CC777AA77CCCACAA7GA7C 

C7ACA7GA7A77AAGGAACACGGGCCAG7AACCC7CCAAGCAA7GGA7G7 

3GTGG7GAAGT7TGACC7CATGATGGAGCGGAGGTTGGTTTGAAACC7AA 

GAA777AA777A77G777CAAAC7G77C7CCAC7CAGCG77A77AAAGCA 

7ACA7AA77GACACA7AAAAA77G7A7A7G7C7ACGG7G7ACAA7G7GA7 

G777CGA7C7A7G7A7ACA77G7GAAA7GA77ACAACAAGC7AAA7AACA 

7ACCCA77CA7CG7G777CAAAGGAA77AAAC7CAAGCACAAAAGAGAGG 

7GC7G77GAAGAG7AGGGC7GC7C7A7C7AAG7AG7A7G7C7GGGG77G7 

CC7GGA7CAGGG7CC7777G7GC7AG7AA7AAACCAGCCC77C7GGGGC7 

GC7CCAC777CCC CACA7777 C77C7GG AGC C7 CC C7 AAGAA77 AGG ACA 

7GGCCAC777C7C7GCA7AGGC77CC7AC77CAACAAGGACAGGGC77G7 

GC7GCCCCA7GCCAC77GAG7G7CCC7ACAGCACAGAGC7GAG7GCACAC 

7GGC7GAG7GAGGAAA7CCCCCAGA77AA7C77GG77C7AAGCA7CA7GG 

C7G7A777CACACG7A7A7GAA77ACAAA77ACAGCA7AG7CGAA7AAGG 

A7T777G7GC7ACAAC7GGAA7CCCAGA77A7GCAAA77GGA7AG7A7AA 

7A77GAAA77CC7AGGAC77777A77AG7777AAAAAA77A7ACAAGC77 

AGAG7AAGAAATTAAACAGTGCAAAAGAATTCACTGTGAAAAGTAAAATG 

37C7G7C7C7GCTGAGAGACAGATATTGCAGCCCAGATACTACTGGGG7C 

PA7AG777CC777AAGCA7GCCA7777GA7GG777A7GGGAC77ACAGC7 

:AAGAAGC77GACAC7AGGG77GA7C7CAGAAAA7CA77G77GCAGG7A7 

7AGA7A73ACCG7C7CA7AAAGA7ACACACACAGACACAGCGA77GGAGA 

TA77CAC7GGGGC77A7GGGC7GC77G7CC777C7GC7C7G7GCC7AAG7 

TGGGC7CAGAG7AGCC7GGCA7CGGC7G7GGGGAGAA7GC7GGCA7GGGG 

77AGCAGGAGCCCAC77AACA7G7CC7AAGCCACC7GGAAGAG7CC77CA 

AGGAGACCAGAC7CCAGAGGCCCTAAGGAAGGAAGGAC7TT7GCCCG777 

77AGG7A77C7AG7CCCAGAG777AGGGAGGAA7GG777GGC777GGG7C 

37G7GCCCC777ACCGAG7GGGA7GGGA7GTGCCCA7GAGC7G77GAGC7 

GGC7C77GGAGAAGACAGCAAAAGCGGGAA7AAGAGG7CAGGAAGC7G7G 

TGG77G7AGGAAA7CCCAGCAGAGGGCC7GGGGG7CAAAAG7GG7CATGG 

7AG7GACGG7GGAGGC7GAGG7GG7AGAAAA7CAGAGGACAAACCCCA7G 

GGC7GC7GG7GA7C7GACCGAGC7CC7A7GC7C7CC7GG77CA7777AGG 

C7C7G7AGCAGCAGA7GA77GGC7GG7G7GAGAGCAG7GCACC7GCCA7A 

TCAGGCAA7CCAAGACAAG7CCAAGC7ACGC7GGGAGGAAACC7GAAGGC 

AGCAGCAGG7AGAC7GGC7GAAGACAGACAGGCAGGCAAC77G7CAA7CA 

GA777G7G77777AAGGAC7777AAC7GGGGAGCCC7CCGGGACAGA7CA 

GA7GAGAG7GAAA7G7GC7CCGCCT7AGCC 

>Concig37 

3GCCG77CGCAA77C7G7AAAAGGGAGAG7GG7777A777A77777AAAC 
A7AG7CAAGC7GC7AAAG7A7A7GA7A7G7A7AGA7AGAG7A7AA77AAA 
7AC777CAAC7ACAGACAAAA7CAGGAGAA7GGAA77AAAAAACAA777A 
GAAA7GGG7AATGGCAGCATTGGG77GCGCCCACCCACGAGAAGGCAGAC 
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AC CAAGA77 C7AAGA7Cr» JACG7GGC CAGCACTTCAGACTTCAAAlTAGArt - 

77CG7GA77A7GCA77A77TT7C7CGGAAAG7777CAC77CAC7A7A7GC 

TACTTGACACTTGCTTTCCTAAGACATCCCTCTA 1 TTTTGAGATGACTAA 

Z7CAGCAA77CA777C7C7CACGCA7AAGC7G7CAC7CAACCCAAACCCA 

CCAAGCC7GCA77C7ACCC7CAA7AAGG7C77GG7G7G7AAAC7GACCCA 

C77CACC7AG77CC77AGCCC7C7C77GACCAGACA7GAC7C777CA7AA 

3 CTAGAC CTATAAAGT CAGGGCT C77AAG7AGC7GA7C7C7GA7AG7GCC 

AX&7G7CCCCCAC7G77CACA7777CCAC7CCAGC77C7AACAGG7GA7A 

GAC7GC77777GGGGG7AGGGGCACCAAAACA7A7AGACC7CA7G777GG 

A7G7AGACAC7CCAG777C7TTAAA77ACAAC7ACA7AT7AA7AA7GAC7 

7CCAAG7G7ACA77TCAG7CCAGA7C7C7CCC7GGA7CCCCAAAC777G7 

AAAACC CACCGCCTAG7TGATA7C7777GA7G7C7GACAGGCA7T7CAAA 

TTTAATACTGTCACAAACAAAGTTATTGATTTTCATCTC7GCATCTGTTA 

CAAATTTTTCTTACTTTGGTAAATAGCACCCCAGGCTGTGTCACTGCCAA 

GAAC777CCACAGC7CTTGGAA7AAAA7TCAAAA7A7T77CCAAGGCAGA 

AAGGCACAG7G7AA7C7GGCTCC7GCC7ACC7C7CCAACC7CG7A7CACA 

CTAGTCTCCCTG7CACTCACCCCCTCCAGGAGCTCAGGTATCCTTAAAGT 

77C7777C777777777TTi'TTT77777777GAAACAG7777GC7C7G77 

GCCCAGGCTGGAGTGAAGTGGCATGATCTCAGGTCACTGCAACCTCCGCC 

rCCTGGGTTCAAGTGATTCTTGTGCCTCAGCCTCCCAAGTAGCTGCAATT 

AGAGGCGCG7GCCACCACACCCGGC7AA77777G7ATT777AG7AGAGA7 

GGGGTTTCACAATGTTGGCTAAACCGGTCTCAAACTCCTGACCTCAAGTG 

A7C7GACCAC77CAGCC7CCCAAGG7GC7GGGA77ACAGGCG7GAACCA7 

7G7ACCC7GCC7CC77GAAG1'1 'I'l 1' 1GATCCAGACTCATTCCTGCCTTAA 

GG7C77GCA7C77CAGTCC7CCCC7CAAA7GACACC7CCA7 GAAGA CGCA 

A77ACC7G7AA77ACCG7GTCT7A777AG7CAA7G7GT7GG7777C7G7C 

7CC7CCAC7ACAG7GTAAGC7C7A7GAAGGCAGAAACCTTGGCAG7CCAG 

77CCCAGCACAG7GCC7AGCACACA7AGG7AT77AATAACACACAG7AAA 

A7TCACC7TTTAG7GTGCAA7TC7GAG7777GACAAA7GCA7CAAG7CA7 

7TAAG7C7GAC7A7TA7CAAGCTA7AAGA7GG77GCAACAC7A7CACTAA 

T7CCC7CA7GC7CCT TGG7AG TC AG7C 7CACCCC7AACGCCCCCC7CC7G 

GCAA7CAC7GA7CCG7777T7G7C777A7AG7777GG77777CCAGAA7G 

CCAA7AAC7AAG7TTTGAATGAATGAA7GC7A7TAACTC7CA77TC7GAC 

7CCAGAGCAACA7CCA7GCAA7ATTTA77A777CAGCCCCAAA7AC7GCC 

CCC7CACC77CAC7C CAAC CACC7AC77GA7GA7ACAAGG7GAGACA77T 

GGCA7G7GC77CC7CCATGTTCC7AGCA7777CCC7ATC7CC77AGCC77 

CC77C7AA7CA7AAACGAAGAG7GAAC777CCC777C7AAAGGCAAC77A 

C7CC7AGGACC7CGA7GCCATAA7777G777C7C7AG7AC777C7A7A7A 

7ACACCAAACAA77AGC7CCAGAAAGG7AAAGAC7CAC7G7G7GC7CA7C 

AC7G7G7C7CC7AGCGCC7GGCACAC7GCAGG7GC7GAAGAAACACC7AC 

AGAA7GAG7GAA7GAATC7CTCCC7CTC7AGAC7CC77C7C7777G7AA7 

CAAACA7G77CAACC7GCAACACAG7C7TATGACCAA7CC7C7G77GTC7 

GACC7AGGC7GAGCTCCAGGGCTGGGACCCTGACT7CCTTA7TCACCACC 

7 CAAGG7C7 C7GCAC7CAC77C7C777C7GC7CAGGA77G77777C77C7 

7G7CACCAG7C7TTTCTCAGAC7TAGG7C7CAGC7CAGACAT7GCTGTTG 

AAAG7AC77CTACTGATCCTTTTATC7AAAGrCAGCCATTCCAGCCCTACT 

C7C77GA7CATAGCACCCTGAATTAAGTTGTTTACTTAC7GTC7CTTCAG 

GAGGGCAAGGAGC7TGGTGGTGGTG77CAGGGCTGTACCAAGCTGTACCT 

7GC77CACC C7GC7ACAC7TT7TAGCAACCA7C7AATTT7ACA7GC7CCC 

77CAC7CG7CAGAAATTTCCTTAT7T7CTACTTCAAGCAGGTA7ACATA7 

G7GC77C7CC7GGGAGGC7CACCCAC77CA7GAGACTACATT7GG7CCTG 

GG7AGAAAG7G7ACAAAA7CCACTGGC7CAG7TT7AATCAATG7A7G7TA 

A7A77AACCAACC7GAGATCTTGAT7TCCACGCC7GGCTAATT7TG7AT7 

77TAG7AAAAACAGGG7TTCTCCATGT7GGTCAGGC7GGTCTCGAACTCC 

CGACC7CAGG7GA7CCGC7CACC7CGGCC7CCCAAAGTGC7GGGAC7ACA 

GGCA7GAGCCAGCG7GCCCGGCCTAAGA7C77GA77TCTACCA7C7GAAC 

7C7G7A777GAACTGACTGCTCCTGC77GAGC77ACTGGCCAAAACTTGG 

CCCAC7CAGAC7CACGGAAG7T7C7GGrrC77CCC7GGTAAC7TT7C7GA 

AC77AACCAC7GGT77GCTTGACAAGAGAT7ACCATCTTC7CACTTCCTA 

3C7A7G7GAAC7GAC77A7CTGC7C7A77GC7G7TCAGTC7AGCACGGCA 
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CTTATTGAACGAGTGTCTACATCTGCACCCCCTACTTCTTACTCATCCAT 

rCTGTTTCAATTTCTTAAAAAGAAAAAAAAAAAGCTATTGTAAACATACG 

A77ACAGAAAA7GA77TATAACA7G7G7A7G7ACCACC7AGCCCTG7CAA 

G7C77AA7A777G77A7A777GC77CAAA7C777777CAGAC7GTAG77A 

AAAATTACTTAGGAGCCATTATTTATGGCCTATTTCGTGACCTAGTCTTC 

77GA7 GG7 CAA777GC C7AA7CA7C77AAG77GCAAAAGC77AGAA77AA 

AG£AAAG7ACC77CGA7CC7C7GC7G77GCC77C777T7AA7A7T7GGG7 

TTGT7TGGGTCCCATTTACGGTTGTGACATCAGCTTGAGTTTTGGGAGCT 

G7C77377CAGAAAA7GG7TC7GGGGAACAGCC77TT7CAAC77GGAG7C 

ZAAAG7C7GTGC77777GC7GAAAGCCA77A77G77A7G777AX7ACCAC 

TGGTTCCATTTGGTCTTATGCTAGGGGTGCTTGGAATGGCTGAATTAAAT 

CTGCCAACTG7CAAATTAGGCCTCTGGCTTACGGCTTTTGACTTTTGCAG 

TACACATGATGTCTGAGGTATACAAACTTGGCTGGACTTCTGATCTTGCT 

TGATGTTTGGATGTCTGTTGTTATATTCACCCTGAAGCAAACTGGGGTAT 

GTTCTGGGTTTGGTGTGCTTCACTCTCTGTTCAG7AACAGGGTATGACCG 

TATCTTAG7TTCATTTGGTCTTTCATATTGACTGCTATTAACCTTTATAT 

CTTTGATGTTCTTGACTACTGGTTTCTTTGATGACTGAACTTTACTAAGG 

GTCCGAATAAAGTGAGAGGGAACCGTCCTTGAGGGTTTTACTCCTGGTCT 

TGCAAGA7C7GC7CC7C7AGAGAG77GC7G7GA7777AC7GGGAAAG7CC 

TGCTTTGTG7TTCTCCAACAAATTGTTTATTAACCCTATCTTTCAGAACA 

GCAC7A77AAC7GAACT7T7GCCCAAGGC77G777AGGAAC7AAAC7G77 

C77GG7T7GA77AXAAGAG7CAG7C7TTGGC77ACT7C7GG7ATATAAT7 

TAGGA7C7GGCTTCC7C7CAGG77C7G7TAAGA7A7C7AGCAAG77CTC7 

77G777G777C77T7AGAAAG7TA7CCAAAGA77CG7777CAACA7GGA7 

A77A77CA7AAAG7C7A7ACA777ACCA777CC77GA7C7G77AAC7GC7 

GC777G7AG7777CAA77GC7CTA7AX7AAG7GACCCCACAGG77T7C77 

GACAG7CC7CC7G7GGTGGAC7ATC7AGCT7CACAC7G7TGAAAAC7CTT 

GCTGAAAAG C77AGACTATGGG7TAGAAGAAACACA7T7TGAAG7CCGCC 

TTTTGCCCAGAAGTT7TGGTGGC7C7AAC77CAGC77C7GGGACCCTGCA 

GTAT7AGG7GG7C7GGGC7GGAGT77AA7GC7GA7GGACCTTT7AGGT77 

GACAGGCAAAACAACATGGT7GG7AACA7CA77777GGGTCTAATAG7CT 

GAAAAAACAAAGAAAA7ACATA7TAAAAAA7CC77AACA7ATCT7AT7G7 

7TTTAAAA7AATAACTG7GTTTAACACA7GC7AAAAAAAAAATCA7TTT7 

AGAA777CA7C7AAGAAA GTTGAA 7CC 7CAGAAA G7AAAGAAAGACTCAC 

7AA7AGG7AG7777TG7G77T7TTT7T7T77Tr7777TTGAGACAGGA7C 

77GC7CTG7CACCCAG7CTGGTG7GCAG7GA7GCAA7CT7GGC7CAT7GC 

AACC7C7GCC7CC7GGGT7GAAGCAA77C7CCCACCCCAACC7CGCAAG7 

3GC7C-GAC7ACAGGCGCA7GTCAC7ACACC7GGC7AC7rr77TG7A7777 

TAG7AAAG77GGGG777CACCA7A77GGCCAGG77GG7C77GAAA7CC7G 

ACC7CCAG7GA7CCACGCACCTTGGCC7CCCAAAG7GC7GGGA7AACAGG 

TA7GAGCCACCACACC7GTCCTAACAGG7AG7T777ACAACTTGAG77CC 

7A7CAGAAG7A7A7TAGAATCTTTTAGC77GACAGAAT7AAGCAGAGATG 

CAG7GAA7A7ACAAAACTTGCTCTTTCAAAAA7GAA7T7GCC7CAAACAG 

TAG77G77GAA7GCCTATTATATCC7AAG7GCCC7CCAAAGAACCC7GAA 

AAAA7ACA7ACA7AATGAACTTATG7TAGGGIACC7CCCAACAAATCTCT 

CC7AG7AC7T7G7ATAGCCACACTATATGTTT777AAACCACTGCCTT7G 

7AAACA7CACAG7ATCACTCAAGAACCTCTGTC7CATCCCTGGAGATCAG 

7GACAAGGAGA7AGGTGGCAGATGA7G7GAGGCC7GAGA7ATGCTGCCAC 

AGC7C7CAA7AAACATGTAACATCTTAA7AG7CA7AT7TGTAAAATCAGC 

CAGGACAGGG7TTTAAGGT7AGAGTC7ATGT7AATAA7AAACAAATGT77 

AG7CA7G7GA7T7AAG7TTGGATAAGAAAGG7AGGAC7CGA7TACAGAGA 

A7777GAAAAC7AGGGAAGGGAG7T7AGAAT7CA7A7GG7AAGTAA7TGG 

GCAAGCCAC7ATGAATTCC7GAGCA7C7C7CA7GAAAGCAATTACTCAGA 

AAGGAGAA777CACAGAGA7TTA7GGAA7A7G777CCAGGGTAAGA7ATG 

GGAA7GC7AGAG77ACCAC7C7A77777GA777GACAAA7A77G7GAAGA 

ATCAC7ACA7AAAC7TGGCGAG7A7G7AAAGGA7T7CTAACCAGAACCA7 

77GGCA77GAGGGCAAAGAAA7G7C7AC7C7GGA7GATAGCGG7GXG7G7 

GG7G77AC7AGGAG7GAAACAGCGGAG77GGGAG7GGGAGGCAGAGAGA7 

GGA7GG7A7ACCCACAATGGCTA7A7C7GGA77AA7C777GAGCACCAAC 

A777A7A7ACACC7CGGA7C7C7CCA7CA77GC77AC7GAAGAGG7GGAG 
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3GACG77GGCA7GAAAG c iTCCAAATGTGTTTTTTTAGTTGCTTTCTTA r 

A7A77AAAAACGAATTGATATAATCCACAAACCA7AAAAT7CACCATTrT 

AG7AAG7GCACACT7C7GTGGATTT7AG7ATAGCCACAC7ATTATACAGC 

AATCACCAC7G7C7AAT7CCAGAACATATTCATCACCCC7AGAAAGAGAC 

77GGG7TTAC77G77GGCAGTCCCTCCCCA 

>Concig3 8 

3 GT C7ACATGTGC7 CG CAAGATTGGATATTGAAATATCAGCAAGAAATTA 

AA7GACATAG7AG7CATTATGCCTAAATTATTGT7ATTTTTTGATTGAAA 

AAAG77GAATATT7CAAATATCAAGG7AGTAGTGAGATATAATAAAGAGA 

GAG7CAGT7C7AAGTATAGAATTGCTGATTCAGTTAAGCTCTGTTCTCCA 

ACA777GGGCCACATTGAAGAGACCATGTAGC7GCTT7CAGCCTCGGTT7 

CCTCCTTTGCAAAATGGGGATTACACTACCTGCCTCACAGAGATGTAAAC 

TTATGACATGTTATCATGATTGCCAGGGCCCACC7 G TTTTCTTTTAAACA 

TTGAAATCACTGTGCCTGAAACAGGGATTTCCCTGCCCTTTGTGCAAGCT 

CCAGAAACAGGAGTCAGCCTGAGTCCCGCAGCTAAGAACGTGGATTCTGG 

T.CATTT7CTCATAGCGAACACACTTCACAGGTCCTTCAAGGG&GTACATT 

7TCCTATAACTCACCTTAATCTCAGTTGAAGCCTCGTTTCTTATTTTGCA 

CTGTGGCCAAAAACTAAATCTCATTTCTTTCACGTAAACTTCAGCAATTC 

AATAATAGTACAGTCATTTTATGTTTCAACTGAACCAAGTCAGGGTTCCA 

CTCCTGCCTGCCCTT7CTGCTCTGAGGACATCCATGAAG7GGAGGGGGTC 

7ATGTAGCC7GGAGCTATTGGTGAGGGGCGATGGGTCCGTGGTGGT-CTTG 

GGGAACTGCGGGGCTGTGTCTGGCTGGTCTGGTGTCTGGTGATTGGCCTT 

GT7CCACGCGGTTCACGCTGCAGGACAGTTCG7GTCCT7C7TGTCC7AAT 

GA7CAGCTT7TAGGCTCACGGGCCTGTCTC7GC7GAGATATGGAATAGGA 

CAGCC7C7GGA7C77C777AAAC7C7CC7GGGGCCACAGGGGAC7C7G77 

TG7G7C7G7GCCCACA7AGGA7GA77C7GCCCAGACC777GC7GCCA777 

C77GC7G77C7GC7G77777AG7C7C7GGAGGGC77GCAG777CC77GGG 

G7CCC7G7GGAAGCAAAGCAAAG7CC7C7CCACGC7CAGA7G7C7AAACG 

7A7C7GGG7777A7CG7CCACCCA7CCCAGAGC7CAG7C7AGAGGAGGGG 

GCAGCC7TCGGG77C7C7CC77CC7CCCAGAGCC7C77CC777GCACCAG 

GGCAGCCTC77CC7A7C7G77GGAAAGGGC7G7C7GGT7C77GAA7A7AG 

AGT7GCAGGTT7GAGGGG7G7AGGC7GAGG7AAGGCAAAC7A7CACA7GG 

AA7AAAAA77ACCC7G7G7CAAGGAACAACCAGAGC7GGACAG77777AA 

A7G7GAAAACCAA7777A77CAGGAC7A7GGCGAGAGG7GAAG7AAGACC 

7CAG7A7AGAAC7GGGC7CAA77CCGAA7GCAGCA7GGGCAAA7GGGAA7 

G7A7AGCC7AGGAGCAGGG7GGGAACC7G7GGA7GAAGAA77AC7AAAAG 

GGCA7A7CAGGGG7GAGGGGGCG7CC7GGC7ACACCCAC7AAC7AC7G77 

GC7GAAGAAAGGCC7GG7GACA7CAC7GGGGAA7GG7GGGGGA7GAAGAA 

77CAA7CAGA7GGATATTGAGGATAAGGGGATC77GATAAACTGGC7TAG 

3AGGG7T777GC7AAAAC7G G 7777CA7AGG7AAG7CCACAGACAGG7C7 

7GGAGAAAG77CAGGGACC7ACGG777G77CGGGCAGA7GC777G7CA7C 

7G7CACAC7GGCAC7G7CACC7GGC777CC777AG7CCC7CCCCCCC777 

7777777C7GGAG7AG77T7GGGAGACCAGAGGAGCAGGGAG77AGGGAG 

AG7AG7CAGAAAAGGCCAGAGAAAA7AAGGAGG7G7C7G7AGGGAAAA7C 

C77AAA7CC7C7AA77AAA77AA777AA777A777A7CTGGGACAAGG7C 

TCAC7C7G77GCCCAGGC7GAAGTGCAG7GG i tG7GA7C7CGGC7CAC7GC 

AGCC7CGACC7CAGGGC7CAAGCAGTTT7GCCACC7CAGCC7CC7GAG7A 

GCTGGGGCTCACAGGTGTGCACTACCATGCCCGGGTAATTTTTGGGTTTT 

I ' T T TT T riTr TTTTT rm T T rrTTTTTGTAGAGATGAGGTTTCGCCATG 

TTGCCCAGGCTTGGTCTCGAACTCCTAAGTGATCCATCCACGTCGACCTC 

CCAAAGTGCTGAGATTACAGGCATGAGCCACTGTGCCCGGCCTAAATTCT 

C CAATTTTTAAATGCTTCC CTGTTCCCTGTTCCAGATTTGGGATATTGAC 

7GCTGTTAAATCAGCGATTTCTCCCTGTGGAGAGGTAGCCAATAGGAAGC 

AACAAGAGTGAGGAGTCCTTATATCGAAATAGAGGGTAAGAGAAGAGACA 

GATGTTATC7TGGCAGTGATTTAAGAACAGCGAGTCTGTAAGCAAAGCAA 

AGCAAGGCTCCCAGGTGCTGAGAAACAATGGCTTTCTGGGGAAGCGTCTG 

TGT7CAGAACC7TAAGTTGGAAACATCTCTGAAGATGTTTGCCATGAAGG 

7TT7CTTC7GAAGTTGAGTCTTTCATCACTAGGTAGGCG7GTTTTGGAGT 

C7C7ATCAAACAGATCC7GTGTTTAT7AGGAAGCTGTGGT7CATAAAGCC 

7CA7GCTAAT777GCAGGTAGCAGGG7GGCCC7GGCCTGACCCGGGGACA 
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GAG7GGC7G7CC7CCC1 -^GGCAGGAAACTCTCTCCTGCCACCTAiST^w . 

CTG CAT AC C CACA7T7CAAGGGAGC7T C7GGG7GG7G AG777 AC CAGACT 

ATGGTCTGAGGTAGAGTTAAGCAAAACAAAACTAAACTGCATAAAGAAAC 

AGAAAGAAAATCAGG7GTTATAAAAACAATTTGGCATTTG7TTGTGTTTC 

AGC7CCG7GTCGATTTATTGCTTCCACAAATAGTGCCGATATGCACCAGG 

CAC7G77G7AAAACTGAAAATA7GTTTTTGGA7GTGCCCAG7CTG7GAG7 

AT7AAACGATGG77GA7T7GAAAT77GC7ATGA7TCA7A7T7C7GGGGG7 

AAGA7GCAGGAT7TCT77GGGGGGCC7ACGATGTGGCAT7C7AGAA77C7 

CAAAGAA7CAACCC7GG7GGGACCAGGAAGAGC7GAGC7GAGGCC7CTC7 

GC7CA7G7G7AC7TACTGGAGATCATGGAGACAGGTGAGCCTGAGTGCAC 

G7C7CACCAAAGCCACAGCAGAGGGGGAGGAGGCGGAAAGAGAGCTC7C7 

CCAT7TCTGAGAAGT7AA7GG7AACAATGGCATACATACC7AC777ACAG 

T7GAAAT7GGAAACCACAGCA77AAG7G77TCCAA7GAAA77TGGCAA7T 

TGGGAG777TCTGAGC7GCATTGGA7G7GG777TGCATGCTG7TAGGA7G 

AGCAAGAGA7GA7GGAGAACA7CTTCCTT7TGAGC77CCTC77GGACG7G 

GG7CAC7CCCAC7CATGGAATTAGAAAGC7TAGACC7AGAC77GAA7C7C 

ACCT7CTCAAGG7GCTCCCGGGCAAATCAC77AAGATCCA7CrTCT7CTC 

C7CC7GC7CC77C7CC7CCTTCTGAGT777TT77777CTT7CCAAAA77C 

AAATGACACGG7ACTGGTAGAAGAAAAGG7CCAAG7C7GC777TACAGCT 

CCCC7CATCCCCAAATGTAC7CCGACCCCAAGA7GACCATG77A7CA77T 

GAT7GACATCCT7C7AG77TCAAC7CA777C7T7GCA7GTA7A7GCACG7 

ACA7A7ACAC7AT7T7A77T7GCCAGGGG7CACCG7T7AGC7GCA77AA7 

77CT7A7AAAA7AA7C7A7A777AC77A7GG777ACG7AAAACAACA7AC 

ACA7G7AAG7G7A7AGC77GATAAG7C77CA.CTGTAAACCAAAAA7AAAA 

7TCGAAGCCCCCCCAACCGTCTGAATGGACCCC7CTTCTTGGCCAAGAGC 

A77CCAAAG77AACC7GAAAAAAC7AG7TCAGG7CA7GA7GGAAGGGAAG 

G77GGACA7GCCCCAG7A7ACCC77C7CCCT7T7GGAA77CAGGAAAAGC 

7GACCAGCA77AACA7CAACACAGACC7TA7G7C7GA7AGGAAAC777GA 

CAATC7A77CCC7C7GAAGC7TGC7ACCCGGAGGCT7CA7C7ACAAGATA 

AAACC7TGG7C7CCACAACCGC77ATCATAACCCAGACAT7CC777C7G7 

7GAGAA7AA7T7ACC77G7AACC7GGAAGCTCCC7GC77CAAG77CCC7C 

AC CT7T C CAGATTGAAC CAA7 G7 AAAC CT7ACA7GCA7TG A77GA7G7 AT 

7ATG7C7CCC7AAGA7GAATAAAAGCAAGC7G7ATGTTGAC7GCC77CAG 

CACAGG77G7CAGGACC7CC7GAGGCTGGG7CACGGATGCA7CCT7AACC 

77GGCAAAA7AAAC7GTCTAGATTGAC7GAGACC7A7C7CAGA7AC7G77 

GGG77CAAA7A7ATAAC77ATGAAAC7AATACACAAA7CAAG7CA7AGAA 

7A7T7CCA7CAC7CC7CA7CTACCCCCAAA7T7CCT7A7GCG7C777GCA 

37CAACC7CCCACCCCA7CCCCAGGCAACTGCAGA7C7AC77777G7C7C 

TGCACCT7CAAC7GACCC77TC7G7GA777CATA7GAA7GGAA7CATGCG 

C7GAGCAG7C77TTG7G7C7GGCT7C7TTTGC7CAGCA7AA7G77777GA 

3G777G7CCA7G77tTrGTGTTTG7CAA7GGT7AA7TTC7CTCCA77GCA 

GAG7AG7777C7A7TGTACATG7GTACCACAA777G7A7A7CCA77CCA7 

TGCTGATGGACA77TGA7TTGTTTCCAGAT7T7GGCAA7TATGAATAGAG 

CTACCATGAACACCCAGG7ACAAG7C7TTG7G7GGAC77ATGTTT7CAT7 

7CTC77GGAATGGAACTGTCATATCAATAAGTATATGT7TAACTT7GTAA 

GAAAC7GACAACAAATTATCTGCGATGGTTATGCCATTTTGTTTTTCTAC 

CAGCAA7ACACGAGCATTTCAGTTGCTCCACAACTT7GCCAAAAC77GTT 

TTCTTTAATT7GGACAT77AAG7GG7GTACAGAGGCA7C7CAT7G7GG77 

C7AG7777C777GCCCTGATGACCAA7GGTG7TGAACATCTT7TCA7GTG 

CT7TTTGACCATTTACATATCCTCTTTTGTGAAGTGTC7GTTCAAATATT 

TT7GCCCATT7AAAACATTTGGGGGT7TGTCTTA7TA7TG7G7TGGGAGA 

G7TCCA7A7T7ATTTAT7TATTGAGA7GGAGTCTCACTCTG7TGCCCAGG 

C7AGAG7GCAG7GGCG7GA7CT7GGC7CACTGCAACC7CCACTTCC7GGG 

T7CAAGCAA77CTCCTGCCTTAGCCTCCTGAG7AGCTGGGATTACAGGCA 

7G7GCCACCACAC7GGCTAAG77TTTGTAT7TTTAGTAGAGA7GGGGTTT 

CA7CA7G77GGCCAGAC7GGTCGCAAA7TCC7GACC7CAAGCAATCCACC 

TGCC7CGGCCC7ACAAAG7GCTGGGATTACAAGCA7GAGCCAC7G7GCCT 

GGCCCA7A777A7TTTTTA7TCT77A7T77G7A7ACAAG77C77GG7CAG 

A7ACAA7AA7ACC7GGTCAGATGAGATAATGAG77GGAAAATGC777GCA 

AA7GGGGGAGAA7AAT7TAAA7G77AT77A777A77AAGAGCAGAGGCCC 
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TTCCTGTTGCGGTCACrf-jAAGCCGTTTGCTTCTrCTGCCTTTT^TAj^ww 
AGCAGAG7CGAGC7ACACAGGC7G7C7G7G77GGC7GC7A77AG77AA7C 
AGAGAG77777777TTC77GCC77G7CA77C7AA777G7GACACA7AA77 
AGCCACAA7A7G7G7777CAG77G7GACAC7GGCC7GGGAAACCAAGGGA 
TG777AGAG7GGA777CC77GA7777GCAA7AA77G7G7G77777C7GCA 
7C77C7G77AAACACAAAT7CA7GGAAGCAAAACA7GGAAGCAAAG7ACC 
C7GGACA7CCCCCC77C777A7GAAA77GA777C7CTTAAA7G7AA7G77 
TwC77G 77 C C CTTACTT7AAAAGCAAT7TAAGAGTTTAT7GAGAAAGTGA 
GC C C7GGAAAC A7AGA7GCA7AGAGAGAAAA77C7ACCAC C C7CAGG7C C 
C7A77G7C77C7C7CA7AAAG7G7AG777CAGGGCC7777AGAAG777C7 
777C7GC7C7GA777GCA7G777G7GAG7G77GC7A7777AAG7A777GG 
A777GG7C7GCAAA7CC7ATGAGAGA7GGCAACAGAG7AGGGA7C7CAAA 
GCC7GCAGG77G7ATTAAGTCCAGCAGGGCC77G7A777ACAACAGAGGG 
TCC77GAAGACA77CCA7A7A77A7GC7AGGGGAG7GGCCAAGCAAAC77 
TAA7G7G7CCC7A7GG7GGGA7A777GGGG77AA7ACC7GCCC77C7CT7 
AA777C77777C7777Cl T lT777C77777C777C77777777Tr7GAAA 
7G7AG7C77GC777GTCACCCANGC7GGA77GGAG7GCAG7GG7A7GA7C 

AGCC7CCCAAG7AGCTGGGAC7A7AGGCACACACCACCA7GCC7GGC7AG 
T77777777777777T7GAAACNGAA7C7GGC7C7G7CGCCCAGGCGGGA 
C7GCGGAC7GCAG7GGCGCAA7C7CGG 

CGC7CGCA7CCC7CA7A7CCA7GAG7G77C7G7GGGCCC7GCC7C7GAAA 
7AAA7CC7GCC777G7C7CCCAG77CAC7CCAGCCACCCA7CC7GGGGC7 
GCACCG7GC7CC77CCAAGCCC7C7CCC777CC77CC7GG7GC7GCCTG7 
CA7G7CAAGCA7A7GCA7CAG7GCGACCAGGACA777GAAATGCAACCAG 
TACAA77GGGCGCGGTrA7GCCTACCAG77777C77CC77AAACA77T7A 
TA777A7G777GAAAGCA7GCCACC777C77CAC77GCCAAC77GACAGA 
777A77AG77GACAACA7CCGC7GA7AGCA7CAG7AA7AAG7TAAT7G7T 
77TGCACA7G7AGCTTTAATTA77C7CAT7A7CA77TATAGGAG77ATTC 
77TG7AAAG<3G7AAC7GAG7T77CCAAAACAAACAGAAA777GGGG7GGG 
CCCA7GGAGCG7GAC7CA7GAAA7CAGA77C77AGAAGGACCTCGGCAAG 
7C7C7GGG77GCTGT7AA7GAGCC7GGC7GGC7GCCAGGGG7G7G7C7GC 
CCT77A7GAGGCCACCACTGTTCAAA7GCT7GCC7GCAGCA77AC77GCC 
7AGG7AG7GC77G7T7C7ACTGAAC7G7CAGGGA7CCAAT7C7T7G7GG7 
G7AAG7AACAA7AC7CAGA77CACAAGGAA77GA77AA7AAGCCAGAATG 
CCAA7G7A77ACA77777GA7GAAGACCA7A777ACAG7GA77G7A7C7G 
C7CAAGC7CAAA77AGGAT7AGAG77C7GACAAA7ACA7A7G7GAGAAG7 
. A7GAGG77AAA7AC7TGAAAT77GGAC7777C7AGAAAATC7GAA7G7GA 
T7GCCA77CACA7ACCTT7C7GGGGA7GA7GA77GT7G7AC7TT7A7777 
AAAAGACA7AGAAAAC7AAC7TAAGAA7CAGA77GC77GGC7GGGCACAG 
TGGCTCATGCC7G7AA7GCCAGCAC777GGGAGGCCAAGG7GAG7GGA77 
GC77GAGC7CAGGAGTTTGAGATCAGCC7GGGCAACATGGTGAAA7CCCA 
7C7C7ACCAAAAA7ACAAAAAAAAAAAAAAAAACAACCAAAAAGAA7AAA 
77AGC7AGG7G7GA7GGTGCGTGCTTG7AG7TCCAGC7ACT7GGGAGGA7 
GAGG7GGAAGAAT7GC77GAGCCCAGGAGGTGGAGG7TTCAGTGAGCTGG 
GG77GCAACAG7G7ACTCCAGCCTGGGCGA7AGAGTGAGAC7CCG7CTCA 
AAAAAAAAAAAATCAGATTGCTTTAT7GCTGG1TITC77TC7AAAACTGA 
GA77GGG7CCCATCATCCCC7GGCCCCCA7TGGTTAATGGT7CCTCCTT7 
G7C7A77GAA7AAAA7ACAGATG7CTG C 7777GGCAACA7GGT7GAATG7 
AGACAC7GCAGGGTC7TCCTGAC7CAAAA7GAGTAAGGC7TAGA7AAAAC 
ACA7777GAAATGCATTTCTGGA7GAACAGCAAGGAAAGGAGATCTCT7A 
AAA7CC7C777C7GTTCCCCTCTCCC7ACCCCC7CCAAGTGGGC7TAAGT 
AGGAAGGG7GG7GAGCGGCAGGTAAACACACGTCAAAGGCAGTCTTCCTC 
TC7GAGGGAAAACAC7TG7A7AAGCA77GCAA7CAATGGGCC7CT77AAT 
7A7G7GCCAG7GGCAAGAGCGGG7GC7GAACCCAGGGGCC7GCC7CAA7C 
CGGGGCC777GAG^CAGAA7AAAG7GG7C7CAGG77G77GGCA777CC77 
GCCC77CCACCCGAAGCAGACACAAA7CC7C7C7GGAGGCAAG77CCCCA 
A77CAGCCAG7ACAAC7CCCACAGAC7AAGA7CAA7CATGTACAAGC7CA 
'CACCAAACACACAGAGCAA7AAACAAA7TCA7GAG7GAC 
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GTGAATGAGAATAAACAw J^CAATAACCACCAGCTGGGATGCTCTAAG . 

C77CAGC7G77AGAA77CC7GAA7A7AGAA7AAAAC7GCCACAA7GGO\A 

ACATGCATCTAGTACTTACTGTGTGCTGGGTTCTAAGAATTTTGCACATr 

GTGCCAGATACCGACTCAGCTTCACACTCACCCTCCTACTGTGCCCTCTT 

AA777GCAC7AGATTAAAAGG7AGAAAGGAAGAGGCAGCTA77C7G77C7 

TGGCTG7GC-CTCTGGCAGCACATGCAAAATGGGCAG7AACAGTGGCAGTC 

ACAGG7AAG7AGCC77C7CACAG7G7GGAG77AAAGGCA7GGGAC7GAGA 

CGAGCAAGG77 C C7AAAGGGACAG7GGC CAG7AGA7GACCAGGGGC7AC7 

GGAG7GGC7GCA7GGC7C7G7GGAAGC7CAGAGGAGCC77GGG7CC7GCA 

GG7GCAGTAGCAGC777C7G7AG77CC7GA7C7C7GGG7CCCACAA7C77 

CCCCG77777GC7CC7CCAC77C7AA7777G7AAC7GAC77CCC7G7G7G 

TAC77CTC7C7C7GA77GAAA7AGCCAGAC7GG777C7G777CC7GA7AA 

GACA77G7CTGG7ACGAACACAG7AAC7CA777AA7CCGA7A7C7C7A7G 

AAGGAGG7ACAA7AA77A77CC7A7777ACAGA7GAGGAAACACAGCAGA 

GAAA7AAAGTCAA77G7C7AAGG77GCACA777AG7CAAGGGAAGGG77G 

A7A7AACA7A7AA77A777AGAAAACA7C7AAGGAAA7AAAAGGCA7AA7 

T7AAAAA7AAAAC7AGGCAGG777AAAAAAA7GAAG7AA7C7A7AAG7AA 

AAAAG7A7AA77G77GAAA7ACA7A7C77AG7GGA7GGG77AAA7AGC7G 

AAGAAA7GA77AA7GAAC7GGAAGG7AG77C7GAGGAAA7CAGAA77CAG 

CA7AGATAGAAAAAA7GGGAA777ACAAAAG7ACACAGGAA77A7AAAAG 

AGG77AAA77A7AGGGAGGG7AGAA7GAGAA77AACA77GG7C7AAC7GG 

AA7777GGAAGAAGAGAA7AGAGAGAA7GAACAAGGCAA7A777AAAGAG 

37GGC7GAGAA77777CAGAACCAACACAAAC7A7GAC777ACCAG7AGA 

GAAAACAA7G7ACAC7GAGGAGGA7AAA7AAA7A7AC7A7GAACAAA773 

TAA7AATAA7AC7CAACAAAGACAAAGAGAAGA7G77AAAA7CAGCAAAA 

AAAGAAAG7CAGAC77AGAAAGAAA7GACAA7GGCAGAC7AC7CAACAAC 

AACAA7GGAA7CCAAA77CGG7CAAACAG7A7777C77CA7GC7AGCA7A 

7AGC 

>Concig40 

GGGAG7CCGC7A7GC7CC7AAAGA777GCACC7C7GA7C7GG777G7AG7 

7AG7C7C7777A77GC777A7CC7AC7CAAC7AA7777777AG7GCC7G7 

7777777777777AA7G7G7G77GA7GAC7ACAA77C7AAAC7CA77C7A 

C7GA77CA7GGG7GC777AAAA7C7GAGCAG7C777CGCA777AC7GCC7 

G7GA7GGCCCA7CCCACCAGC7AAAG7G7G7GGCCAC7GC77ACAGCACC 

A7G7GA7AACGAG7AAGGGAGAGA7GCCGCCCAGAC7C77C7AGGAGCAG 

CCAG7AGGACC77CCAGGGG77GCAAGCAAACCACAGCAA7A7G7GGAG7 

G7GGCAGAGGA7GGCCCCAAGAGGA7G7GGCAGCGGC7AG7GCAGC7CAG 

C77AG7C7GAGAGGAAA7GC7GGAGAGGAGAGCCCAG7C7G7ACAGGCA7 

GACAGCCACAAGGAC77CAACAGC7AACA7GGC7GAG7GGAC777A7G7G 

C7 A7 C7CA77 CAGAAAACAGGAGCAA7CAGAAAGGAG7CAC C7C C7A777 

G7ACC CCAGG AA77GC7AACC7AC77GCA7C7GAA7GA7G7CCA7CAC77 

CCC77CA7CACC7CC7C7GGGGGCrC7GCAAGGA777GAC7CC7GCA77A 

G7GA7C7G7C7CACC7ACG77G7GA77CACA7GAAC77AC7AA7G7GC7A 

TG7GACAAC7ACCA7C77AAACACAAAAACCC7C7777GA77C7G7GGC7 

CCC7CCAGC7ACCCC7GCATTTC7C7G7CCCCC7GCCCCGTC7C7GCAC7 

CAC7777A7777ACAGCAAAAC7AC7CAAGGGAG7C7CAG7GC7CC77GG 

C7CCA7G7C7CCACC777CA77C7C7CCTCAG77CAC7CC7G7CAGGC77 

CCG7CC7CAAGC7C77C77CAC7777G77C7AGGGCCGC7GACA7CC7C7 

77C77GCCAAA77CAG7GGCCAGG7CC7CAC77AC7CAAC7GC7CAGCA7 

TG77GGGCC7GG7GGACCACA77C7CC77CACCCACC7777GC7GC7C7C 

7C77C7C7CCAGA7G777C7C7C77C7CAC7GGC7AC7CC7C7777G7C7 

CC777G77AGC7CCA777C77CC77CCAACC7CAC7G7GCrGG7G7GCCC 

AG7GC7CAG77777AGC7A77C7C7C7777CCAG7GGCA77CA77AGA7G 

G7A7CA7G7GACCCA7GGCA77A7A7GCC77C7ACA7GACAGT7AC7CC7 

GA A7A7 G AA7C 7CAGGAAAGA777GGA777A77777AA77AA77 7 7777 A 

AA7777A7777AA7AAA7GAGG7C7C7C7C7G7CA7CCAGGC7GGAG7G7 

AGTAT7GAG7GATGTGATTATAGCTCAC7GCAGCC77GAACCATGGGCTC 

AAG7GA7CC7CC7GCC7CAGCTTCCTGAGTAGC7GGGAC7ACAGGCATGT 

GCCACCA7GCC7GGA7GACT77T7G7GTGTGTG7G7G7GTGTGGAGACAG 

GG7CT7GC7C7A77GCCCAGGCTGA7CACAAAC7CC7GGCC7CAAGTGA7 
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CC7C7CACC7CAGCCT'- -CAAAG7GC7GGGA77ACAGG7G7GAGACCA.W * 

C73GGC7AAGA77CAGA7777G7A77CAA77GAC7G777GACA7C77CAC 

77GGACACC7AAGAGG7A7C7CAAA7A77AA77AAC77GGCCAAAA7ACA 

3AAC7777GACCCC7GCCCCCACAA7AC77GCCCC77CCCCAGAC77C7C 

CA777C7G77AAA7A7CCCCAG77AC7CAACCC7CAAACC7A7GAA7GCC 

C777GA777C777C777CCC7CA7C7CC7ACG77GACGCCA7CAGC7AG7 

777G77GCC777A7GCCCAGAA7A7AA7CC7CACCACC77C7C7CC7A77 

3CC CGAG7A7AAGATG7CAG77777CC7GCACAG7CCA77GCC C7GACC7 

CC7GAG7GG7T7GC77CCAC7777GACA777G7A77CC7C777CCCCCAG 

3G7CAAT7777CACAGCAAGAG7GGCA x ■' i i' l 1 ITITI ' I T. 1 1' l iT'I'T T G 

AGACGGAG7C7CGC7C7G7CGCCCAGGCCGGAC7GCGGAC7GCAG7GGCG 

CAA7C7CGGC7CACTGCAAGC7CCGCC7CCCGGG77CACGCCAT7C7CC7 

GCC7CA GCC7C CCG AG7AG C7GGGAA7ACAGGCGCCCGCCACCGCGCCCG 

GC7AA777777G7A77777AG7AGAGACGGGG777CACC77G77AGCCAG 

GA7GG7C7CGA7C7CC7GACC7CA7GA7CCACCCGCC7CGGCC7CCCAAA 

G7GC7GGGA77ACAGGCG7GAGCCACCGCGCCCGGCCAAGAG7GGCA777 

T7AAAACCA7A7A77AGA7CA77GC7777G7G777GGGAACC7CCAAGGG 

C777GCA7CA7A7A7CAAG77GACACC7C7CC7ACCCAAGCC7GGC7C77 

7CC7GC7CC7C7G7CC7C7CAGCCCC7CCACCCA77G77CATGC7GC77C 

AGCCACACTGGCC7TC77GCCA7GCCACA777G7GC7AAGCCCACA7CCA 

A7C7CGGGGCC777GCAC7CGCA777CC7C7GC77GGCA7GC7G7ACCCC 

AGA7C777CA7GA77GGCAGC77C7G7ACA77CAGCCACC7GC7CAAGCC 

ACCC777CAGAGGGCC77CCC7GGCCACC7CACC7GAAA7AGCACC7CCG 

A77GCACCCA7CCGG77A77C7CCA7CC7G77C7C77GC7TGG7GAT777 

CCA7CAC7GA7GAGGAAA7GAACCA7GGAA7GC7AGGGC7GA7GACCAGA 

AC777CCCCCACCCCCACA77A77ACAGAGGAGGAAA7GAGG7CGGAGG7 

AAGA7GGGCCCAGGA777C7AC7CCCGCC7GGAC7GCAGGCACAGCAC7G 

ACC7CAGC7G7GC7CAC7C77GGCA77CACCCAACCC77C7A7C7CCAAC 

7GCCCCA777ACCAGAAAG7GAAA7GT7C7CAGAGACGG7GAGCCACC7G 

AC77GGACAGCAGCCCAGGGCCCC7GGCACCC7GC777C77CC7CCC7GC 

CA7CC777CC7C7CCAAGACC7ACC777CCC7G7GAT7C77GCCCACA7G 

C7GCA777CA7GG7TrrA7GACC7GA777C7GAGAGGGA777GAA7777C 

A7GA77A777A7G7AAGCAAA7CA77A7GC77A7ACAAA7GAGAAAAGGA 

G7GC77C7GGAC77CCCAGGGACAAAA7C77G7CAC77GGC77GC77TCA 

7A77GC7AA77AAGGACCCAGGA7G7GGG7GAGA7G7GC7AAAAGC7GAG 

AGGAGGC7C7GGAC7C7GAC7A7GGGCCCACACCCC7GGGCAGGCA7CAC 

AC7AG7C3777AGG7CA7CC7CAACCCAGC77CCAG77GAA7CAGA7G77 

7G7GAA7AAC7CAGCAAGGC7G7A7GGGAAA7GAAGAA7GAGG7GGGGAA 

3AGGC3737GCAGAAGACACAC7GACTTACCCC7C7ACC7C7AAC7AGGG 

7S7737AGCAGCCACCCACCCACCAAG7C7G7C77CCAGACCACG7A7GC 

r77CC7CCACC777GCA7C7777A7CrrrC7GCCAGCCCAGA7GC77GC7G 

AC7CCAGC CCAAGCC7A7AGGA7AAGC7ACAGCC7G7CCC7ACAGAC7AC 

GCA77GCAGAA7C7AAGACATCAAGTCAAGTTCGGAAGCAC77GCCrrC7 

CC7C7CCAGG7ACACAGGCTCTCCTGGAAAGC7GG7AGCAGC7G7GGAGG 

TG7GG7G7G77ACCTGCTGCAGG7GCA GAGAA G77GAC77CACAGCCCT7 

CAGAAAGAC7GCC77C77CCAG77G7A777G7G7AC77GCrrGGG7G7GG 

GGAGGAT7C7CAGC777C7CCAC7CAAA77A7CAGACCC777CCA777AG 

7GG7AGACCA777CCCTCGTCCAGGCCAAGGGCACA7AG7ACAGAGAAA7 

AGGGAG77G77ACCCAGGGAGAGAAC7TGGC7C7AAACC7G7AA7AGAAA 

GG7CAG77C7GG7CTGGAGGG7CAA77T7GATC777GGC7CAGA7CCAGG 

AA77GGAACCAAGGC777TGAACA7TrrAATGCAGGGGA77AAAAAAA7G 

A7ACGAG7CA77CACGAA7ATA777GC77AACA7C7AAAGAGATCCC7CA 

AAACACTAGAAAAAATAAGAACAAAAA7C7AA7AAAACAAAA777G77AA 

ACACA777ACCAA A1"11 1 l ' l 111 1 1 U G7AAAAA77CAAA7G7CA7AAA7A 

AAGC7AAAG77CC7C77GA7GAC7CGC7CC7C7GCCC7A77CCAC7CCAA 

G7AACCAC7AT7A7CAG7C77GCCAA7ACCC77CCAGACC7C7C7ACC7C 

7A7A7ACCA77AGAAGCACATGG7777GCA77GAGGA7G7GCAG7G7Tr7 

G7777ACG7AAA7GT7ATCAC7C7GTTC77G77CCA7AArr7GCC7Tr77 

C7C7CAA7GA777GC77GGC7A7C777C7A777CAG7AGCA7C7CC777C 

77777AAC77ACCA7TG777A777AAC C77GCC7C7A7CAACAGA7A7G7 
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T C7C CAAAA7GGC ^ A7ACCCAC77AC CCA7ACAGAGA7GA777GGAA7C7 
GGC77CCrCACAAGG7GAGA7GCC77CACAG777CA77C77CC7GGCA7G 
TCTTCCCTTTTGTATCTGAGAGAGCTGGCAGAATTGTGTCACTAAATCAA 
GqATAGAGGGTCAAATGACAGCTCAAGCTCACAGGCACCTCTGCTTTGTT 
CCCAGACCACCTGCTTTCCTGCCACCAGCTCTGTTCCATCTTATAGAATG 
G77GCCAC77GGG7G7C7GC7CCGACAGCCA7G7CA7CC777GCAC7GCA 
G77A7GAAGCAGAGAGAGC7AGGAGAGGGGCT77GCCAGCC7C7GCCC T A 
GC77GGAGAA777CAAAGAAGGAGGG7ATTGAGAG7GAGC7GCCGAAGAC 
TGGCAGC7CCC7CAAC7CAACAG77G7CC77CCACAAGAAG7CAGA7ACA 
T777777GGGA7AAAA7A777AAAAA77A77A7777A777C7GAA7AA7A 
TA777ACA7GA77CAAAAA7CAAAC7G7AGGCCAGGCA7GGC7GC77A7G 
CC7G7AA7CC7AGCAAT77AGGAGGCCGAGGCGGGAGGA7CAC77CAGCC 
CAGGAG77CAAGACCAGCC7GGG7AACA7AG7GAGACCC7GTA7C7ACAA 
AAA777AAAAACAAAAA77AG77GGGCA7GG7GGC7GA7A7GG777GGC7 
CTGTGACC CAACTCAAACCTCATGTTGAATTTTAATCCTCAATGTTGAGG 
GAGGG7CC7GG7GGGAGG7GA77GGA7CA7GGGGG7GGG77C7CCC77GC 
7G77C7CA7GA7AG7GAG7GAG77C7CACAAGACC7GG77A777GAAAG7 
G7G7AGCACC7CCCCC77CAC7C7C7CAC7C7CC7GC7CCGCCA7AG7AA 
GA7G7G7G7G777CCCC777GCC77CCGCCA7GA77G7AAG777CC7GAA 
GCC7GGCAGC7A7GC77CC7G7ACAGCC7G7AGAAC7G7GAA7CAG77AG 
ACC7C7T77C77CA7AAA77ACCCAG7C7CAGG7CA77C777A7AGCAG7 
G7GAGAG7GGA7GAA7A7AG7GCCA7A7G777G7A77CCCAGC7ACCCAG 
GAGGC7GAGG7AAGAGGA77GC77GAGCC7GGGAG777AAGGC7GCAG7G 
AGCCA7GAC7G7ACCAC7GC7C7CCAGCC7GGG7GACAGCGAGACC77G7 
C7CCAAAAAAAAAAAACCCAAAC7G7G7AAAA7G7G77CA7AAAAG7G7C 
77GC7CCCACACC7G7CCC7A7A7A7C77A77CC7CAGCC7CCGACAAC7 
ACT77A77CA777CTTA7G7A7C77CCAGAA7CAAAAAAAAAAAA7CAAA 
TACAAGCACAG7GGAA7G7A77GCCC77C77CCCC7CCC7777G77ACA7 
CAGAG77AGCA7A7CA7AAA7ACGG7C7GCA7777C77C7T777CAGC7A 
7CAGCA7G7777GGAGAGGA777CA7A77CG7GCAGACAGCA7G7A77AG 
7CAG7CC77GCA77GC7A7AAGGAAA7ACC7GAGAC7GCA7AA777A7AA 
AGAAAAGAGG777AA77GGC7CACAGC77CGCAGGC7G77CCACAGGAAG 
CA7GGCAGCA7C7GC77C7GGGGAGGCC77AGGAAGC7777AC7CA7GCA 
GAAGACAAAGCGGGAG7GGA7G7C77A7A7GGCAGGAGCAGGAC7GAGAG 
AGAGAGAGAGAGAGAGAAAGGA7GCCACA7AC7777AAACAACCAGA7C7 
TG7GGGAAC7C7G7CACGAGAACAGCACCAAAGGGA7AG7GC7AAACCA7 
7CA7AAGAAC7CCACCCCCA7GA7CCAA7CACCCCACACCAGGCCCCACC 
7CCAACA7CGGGGA77ACAA777GACA7GAGA777GGGC7GGGACACAGA 
ACCAAACAA7ACCAGAG7GC777C7CA77C7777C7A7AGC7GCC7AG7A 
T7C7A7G7CC777AC77CA777AGGCAG7C7C77G77GA7AGACAC77GG 
G77AC77CCAA77777CC7AT7ACAAA7GA7G7GCAA7GAATAA7777GA 
7CA7777CCA777CACA7GGG77A7G7CCA7C7G7GGGA7AAATC7CCAG 
GAG7GAAA77GC7GGA7CAAAGGGGAAG7GCAC77G7GATT7TCA7AG77 
AGCAAA7777G77C7A7AAGGG7CA7A7GAA777A7AG7CCCACGCG7AA 
TA777AACAG 7GGGGA 777CCCGACAG777GACCAACAAGG7C7G77G77 
AA AC777 7GA777 77G7 CAA7C7GA7GGGAAAA7AC7AG7A7C7CAAAG7 
GC77TTAA777GAC777C77A77ACAA7G77AAGCA7CA7777AC7C7GC 
CCAAGA7CAAA7AG7A7777C7777C7G7GAACAGAC7G77AAGA7CCC7 
TGC CT C77G7777GC7GGA77777G77C 7777 7 7 77CAAA7GTr77GAGG 
CAG77C777ACA7G7GAAACAAG77A7C7C777A7C7GGGG7G7GAG77A 
CAAC7AC7777CC7C7GGC7TG7777GCGC777GACTT7GC77C7GG7GA 
77CCCGCAA77C7GAAAG7G7AC77777GCATCA77CA7TC77A7ACACC 
CA7GC7C77G77CACGC7GG77CC7C7ACC7GAGGGCTT777C7777C7G 
C77C7A7C7GGGAACA7777777GAGAGAGAG7C7CAC7C7C7CGCCCAG 
GC7GGAG7AG7GCAA7GGCGCGA7C77AGC7CAC7GCAACC7CCACC7CC 
TGGG77CAAGCAA77C7CC7GCC7CAGCC7CCCAAG7AGC7GGGA77ACA 
GGAGCC CACCACCAAGCCCAGC7AA777G77GA777A777AT77A77777 
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'GAAC7CC . J 1 


3GC7CAAGCGA7CCACCCACC7CGGCCACCCAAAG7GC7GGGA77ACAGG 

337AAGCCACCA7GCCCAGCCCA7G7G7GGAAA7C77C7G777A7CCC77 

7AGGC7TGA77C77A7G7CG77C7CC7CCC7CC77CC7GGA7AC7CC7C7 

T377C7TTA7C77AC7C7AC77G7CA7G77ACC77G777C7GC77A7AAC 

TAG C7 GC 37 C7 C CTAT C7GAGGAGGGAC77G7GAC7G77C7CA7C7 C7G7 

ACTCCCAGCTCCTAGTACATAGCGCTTGCTCAACAGATGTTTGGTGCATT 

GA7AGA7AAA7CAC7GG7AGC7G77AC7ACCAG7CC7GAC7CCC7GCAG7 

GC77CAGC7GA7CC7G77CCAGA7G7GCAC7GAA7A7CC77C7G77GAAC 

AACAGAAATAAAGGGGATGGGTGAGGAGGATAGTCTTCGGTGGCCAAGGA 

7A7T777AGG7AC777GCAGCAC7CAGCAA7GAGGAG7GGGC777AG7CC 

CCCAAGAAC7C7CACAGCCC7GGG7G7C777AC7G77CAG7G7CAAA7CC 

AAGACAAGTCAATGATCAGGAAAGACCATTTTTTTTTGTTCAGTGAAGTT 

TATTTCAGAATCATTGAACAGTATGATATTTGGTAATTTCATAAATATTC 

CCACTTAAAATGATCGGAGCAGATATATTTTCAGTCGTAATTAAAGGACA 

TGA777AAAGAG AGCAC ACCAG7CCAAA77GAAA7GA77CCAXAGC7A77 

AAAAAACTAGGGTTTTTTACAGACAATGATACTTTTTGCCCCCTTTGAAT 

AGA77AGACCAA7GAA7AAAACAAACAAACAAA7AAA7AAA7AAATAGGG 

AAGCGG77GC7CA7CAGAA7G7GGGAGCGAA7GACAGAGGG777C77AGA 

ACCAAA7G7GGCCG7GG777C7G7CAGGCG7GC777AAG7GAG7AGGAGA 

3G7GAGAGAGGCC7GGC7CAACAAAAGGGC7GGGGA77G7CCC7GAAGAA 

CCAGAGC7GAN77NCA7CAGGAG7AACANAGG7AGA7AG 

>Concig41 

C CGC G77GAGG77 CCACGCAG77CAAA77A7G7CCAAX7A7CAACA77AA 

TGCACA7777CAA7AGAACC7G77CCGGC7777C77AGGAGGGGGGCGGG 

GAGACG77G77C7C7GGGAA7AAG7G7ACGCAGGAGGC7GAGAAGGC77C 

A77CCA7AGCA77CAC77ACC7CCAGC7G7AGAG7GGGC77A7CA7C777 

CAACACGCAGGACAGG7ACAGAT777777C777GAGGCCCAAGGCCACAG 

GTA7777G7CA77AC777C77C7CC77G7ACAAAGGACA7GGAGAACACC 

AC7GAAGAAAGAAGGGGG7C7TG7GG77AGGGACACAGCAG7GCAGGG7C 

ACCCCAACCCCTAGGCCCCA7GAG7AGGA7ACA7G7AAT77GG7AGCC7C 

7GXGGGAAC CCACAG7GAGG77CCT7GGCC7AAGACACAGGA7AAC7TGA 

C77C7CACAGACAA7AGCAGGG7CA7777G77GA777AGGG777CCCC7C 

AAAGGCC7GAGGG777C7CAGAGCC7CA7AGCAG7AGGAACGGAGAA7GA 

AAGAGGG7C7ACA7777AAA7GCTGAAGGAAGGAAGGAAGGAAGCCA77G 

7G7CAC7GGC7GGCAA7G7GCCCA7CCACAGGAGCGGAACAAC77GA7CA 

A7G7GGAAGGAAAGGAAAGAGG7GAGGC7G7AC77C7GCCAGAAA7CAGG 

CACCAGAAC7G777CAGGAACAGAGAG7AGCCCA7GGGAAGAAAC7GGGA 

3AGGAGAGGC7GAGC7GGGAAAG7GGC7CCAAAGAGAGACAC7CA7777G 

A7C77CC7CAG7CACAGCAG7G7CAA77GGAGGCCC7GGGA7CAC7C77A 

C7ACCGGA77CCAAAGAAACAGGA7777C77GGCC7GGC7GAGAGCAAA7 

AGC77CCCCC7GAG7GAGGC7G7CC77CAAAG7CAGCAGCC77AG77GCC 

CACAC7CC7G7GCAGAGG C T77 G GC7AC7G7GGCACGA7GCCAGGCAGA7 

CACCACAGC7AAXGA7GGGT7CACCGCACTTGAAAC777TGCCCG77ACA 

GCGGAGAGA7A7AAG77CCTGC7GGGCGGTAAAA777CCC7ACAAGGAAC 

CACC7GGCA77GGGTGGGACGGATG77GGGCCAAGGGGGGAAGAC7GGGG 

AGGGGGA7GGACACA77ATCGCTCCAGCAC7C77G7TrCAGCC7CAACAA 

CAGGAAGAGAGAACCCACAGGCAG77AGGCCA7G7CCA7CAAA7GACCCC 

A7A77G7GGAAGAA77GACA7TGCAC7A7GCCCAAGAGACX7GGG7GGAC 

A7GG7CC7GGGAG7GC7TGAGCCG7CTAAT77C7CAGGG7CACAC7CC7G 

77AACAAA7GCAC7GGCCAG7GCAA7CAAA7G7GCCA777C7AGGACCAA 

AG777G7A7A77CC77777AA7A77777777CAC77G7G77GA7CA777G 

CC77AAA77AAC777C7ACT77G7TTAAAACA7GGAGAA7TAGCAAGC7G 

CCAGGAGGCCAGGCAGGGAAACCAGGA7GTrrCCA7TTACC77G77GC7C 

CA7A7CC7G7CCC7GGAGGTGGAGAGC77TCAG77CA7A7GGACCAGACA 

7CACCAAGC777777GC7G7GAG7CCCGGAGCG7GCAG77CAG7GA7CG7 

ACAGG7GCA7CG7GCACA7AAGC77CG77A7CCCA7G7G7CGAAGAAGA7 

AGG77G7GAAA7G7GGAGCACA7G77G777AGG7A7AAAA7CAGAAGGGC 

AGGCC77G7GAGGCGAGG7GGCAAAA777GA777C77GGAGGACACC7GA 

3CA7A7ACGG7CAAAG7C7GA7GACAACACCAG7AGGGA7GAAGC7GGGA 
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3 I2 GGGTGGCTAAGAAc -CTGGACCTGACACTATTAGACATGGG'Sri'C _ J: 
C . . CAGG7C7A77AC7GC7CAC7G7GGCCGAGCAACAGAGC7AC77AGG7 
AAAA7GG7GA7GG7CA7AACAC7AGCCCACAGGGAGG77ACGAACC7C7G 
37GACAA7G7AAG7GAAAGGCCCC7GAGAAAGAG7GAGGGAG77GCAAAT 

A77GC777GG777CCAGCC77C777G7777G7C7CCCCGC7GGGCC77C7 
ACCTTTAAAGGGCTTTGGCTCTGGGGGAATTGAGTTGGCTGGGGCTTGAT 
GAC77 Z GAAGAGGACACAAGTGGAGA7CTAC7GCC7GC7C77GGCTAAC7 
ACC77C77CAAAGA7GAAGGGAAAGAAGG7GC7CAGG7CA77C7CC7GGA 
AGG7C7G7GGGCAGGGAACCAGCATC7TCCTCAGC7TGTCCATGGCCACA 
ACAAC7GACGCGGCC7GCC7GAAGCCC77GC7G7AG7GG7GG7CGGAGA7 
TCG7AGC7GGA7GCCGCCA7CCAGAGGGCAGAGG7CCAGG7CC7GGAAGG 
AGCAC7GCGGAGAGAGCGAGGGAGGGAGCC7GGTGAGGTGGTCCTGCCAG 
GAACCA7GC777GACA7CAGAGAG7AGAAAGC7CAGAGAGGAGGAAAGGG 
C77GAAAGAA7CCCGAGC77C7AAAGA7CA7CCC7C7C7GGGCCAGGCG7 
GG7GGC7CA7GCC7G7AA7CCCAGCAC777GGGAAGCCGAGG7GGA7GAA 
TC^777AGG7CAGGAC77CAAAACCAGCC7GGCCAACA7GGCGAAACCCC 
77C7C7AC7AAAAA7ACAAAAA77AGC7GGG7G7GG7GGGG7GCACC7G7 
AA7CC7AGC7A77CAGGAGAC7GAGGAAGGAGAA7CGC77GAAC7CAGGA 
GG7GGAGGA7GCAG7AAGCCAAGA77G7ACCAC7GCAC7CCAGCC7GGGC 
AACAGAG7GAGAC7C7G7C7CA7AAAACAAAACAAAACAAAACAAAACAA 
AATAAAA7AAAA7AAAATAAAAAGA77ATCCC7C7C73AAGC7CAAGGAG 
37TAAGGG7G7ACTCAAGGGCACACAGCAGG77AGAGGCAGACTCAAGAT 
TAGAA7G7GGGC777C7GACACC77ACAGGC7A77C7777AGAA7AAA7C 
CCA777C7AC777G77CA7C777777G7ACA7GCCCCACC7ACACCA7AC 
A7G7A7ACC77C7C7A7A7C77777G7A7CCC7AA7GC7G7CACAC7A7G 
A777GC77777CA7GCAGA7GACCA7AACA7777CCA77CACC7A7GC7C 
AC7CAGCAAG7A77CAA7TITrC7ACAC7G77C77T7T777CCTrr77CA 
7AACAC7G7C7CA7AGGCA77C7GCAAA7CC7G7GAGAG7AC7T777G7G 
AAA7G77ACCAC7TTCC7C7TA77CAGAGAAGC7CCG7A77AAGGC77CA 
CTGAGG77GCC77AAGGCA7GA7AA7GG77CAAAGGC77GAAAGACAG77 
AAAGAGACC7G7AAG7GCACAAAAGAAAG77GAGCAGGAGAGAA777CC7 
GCC7GGAGCAGAGCCAAGC7GC7GGAAGAGGCAA7GGGGGCAAAGGCCAG 
GCAGACAAGCCAA7GGGC7CCTCCCACAGC7GCAGCCAACAAG77A7GCC 
AG7C77AAAAC77C7AAAGAAA7A7G77777AACAAGA77GAGGAC7GGA 
T7A7GAGGC7AGGGGAGGC7A7CACAAAC7GGAA7AAAA7AAAGCCAGAG 
AAAAG7GGC7GCC77CCAACCTGCACAAC7GACC7AGC7AGGC7GA7GGC 
TGGGCCACC7AGGAAGGC7AC7GAGCA7CA7A7AAAACAGAAGGGACAGC 
AGG AA7 A7AACA7GG C7C777G7AAGGA7GAG7 G7GAAAAA7GACCA777 
GC7GCCCAAA7GCCC77AGC7ACAAC7GAAAA7A777CAGAAC7GGAGG7 
TGCAGGA7GC7GGAA7C7CAGAGA7CA7CCAGC7CAGCCC777A777T7C 
AGA7GAGG7CCAAAGCGGG7AAAA7GAC77G7CAAGG7CAAACAGCAAG7 
GAA7GG7777C777CAAGTC7CAA77CA7C77777G777A7A7CA7CTA7 
G7C77G77G77A7AAGCTTCACCCCAGG7AGCAAAAAAC7A77C7ACTCA 
AAAGGGG7AGACA7A7G7TAGT7C7CAAGA7CA7C7C77GG7TrCAGAG7 
77AAC7CAAG7GA77GGCATAGGC7GAA7CCA7C7C77AAAAGGA7AA7C 
AAA777A7G77GAAGACT7GGT7G7C77CC7AC7A7GAAA7GGGAAACA7 
7A7CAC7AC7CC7CCCC7G7CACCACCAAG7G7GGCCACCACCACCAACG 
77AG7GAG7GAC7G7GG7GA7A7GA7GACCAAG7GGCCAGG7CAGCAAG7 
GG7GCAGCC7G7G7C7CACTGGAAGAGG77AAAG7C777C7AAAACAAAA 
7ACCA7GGCA7CAAAG7GGCCCAGAAC7CCC77C777GAGC77TCCCTG7 
G77AGAGCCC77CC77GGG77GGGAG77AAACCCA7AG7C77ACC77CA7 
C7G777AGGGCCA7CAGC77CAAAGAACAAG7CA7CC7CA77GCCAC7G7 
AA7AAAAACAGGGACA7G7C7CAA77A7G7C77C7AAACAGGTT7ATTT7 
TCC77CCC7G7G7ACAAGAC77GAC7G77CA7AAGAAAC7GCAAACAGCC 
TGCC7C7CAAAGC7GCC7GAAACACC7GGCAAG777CACAG7GA7A7GCG 
CAGAACAG7CCAGAAGGCAGA77C7AGGCC7GGCAGG7GGGCACCC7GGG 
TGC7CCC7G77GGA7C77GAGGCC7AACC7C7AGCCCAGCAGAG7CAGC7 
AAAA7C7GAGC7C7CCC7C7CCC7CCAAGCCACAC777GCAAAGGGA77C 
G77G7A7737GGGC77GGAA7C7777C7CCCCA777GCC7C7GCAGGAAG 


FIG. 3 (43 of 52) 



WO 99/06426 


PCT/US98/16102 


CCC773CAACAACACA TGGATAGCCTCCAGGTCCCAAGGCTGGAGC * 

377G7AA7GGGAAAG7AG7C777AAA7CAGA777AC77GGCACCC7G777 

3CCAC7GAAAGAGGCAA777AGGGGAAAAA7C7GG7C7CCAAGCACAGA7 

AACAC7C7AC7C77GAAAGAGGAGACC7GC7CA7G77AC7GG7C7CAGCG 

TCTCCACTGACCTGTAATAAGCCATCATTTCACTGGCGAGCTCAGGTACT 

TC7GCCA7GGC7GC77CAGACACC7G7G7AAAAAGGAGAAAA7GAG7GAC 

TTC C C CA7GACGGC7ACG77CA7G7G7GA777C7C7CAGCA7CGAG7GCA 

TGGCAG7CA7GCAAAGAAA7GA7C7C7GAG7AAA7GAA7GAATG7G7GAA 

AGAGAAG7CC777GGG7C7AGAGAAAAGCA777GC7AAACCAAACCCCAA 

CTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTTGACACTAACC 

TTTAGGGTGTCAGCTGTTAGATAAGCAGTATCCATTCCCAGAATATTTCC 

3GAG7CA7AAGCA77A7A77ACACC7GGCA77777GCAAAAAGC7GAGAG 

AGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAGAAAGAGAGAG 

AGAGAGAGAATATGCATACACACAAAGAGGCAGAGAGACAGAGAGACTCC 

CTTAGCACCTAGTTGTAAGGAAGATTAAAGTCATACTTGAGCAATGAAGA 

77GGC7GAAGAGAA7CCCAGAGCAGCC7G77G7GCC77G7GCC7CGAAGA 

GGTTTGGTATCTGCCAGTTTCTCCCTCGCTGTTTTTATAGCtTTCAAAAG 

CAGAAGTAGGAGGCTGAGAAATTTCTCTGTTGAATACCTGATTTCACAAT 

CAAGTTAAAGGAAAGGGGAAAAGAGTATTGG7GGAAGCTTCTTAGGGGAG 

GGGACTAATAAACTGAGATAATTCTCTGGTTCATGGAAGGGCAAGGAGTA 

GCAAACTATGACACATTTTGCAAATGTATCACCATGCAAATATGCATTGT 

T77CC7GACAA7CG77G7GCAG77GA7G7CCACA77AAAA7AC7GGA777 

7 C C C AC G77AGAAGAA7G777AAA777AG7A7A7G7GGG ACAAAG7GGAA 

GACACACAGA777A7ACA7GCACA7AC7777C77CA77CAC77C777G7A 

CTTAAGTTTAGGAATCTTCCCACTTACAGATGGATAAATGGGTACAATGA 

AGGGCCAA7AGCCC7CCC7G7C7G7A77GAGGG7G7GGG7C7C7ACC77G 

GG7GC7377C7C7GCC7CGGGAGC7C7C7G7CAA77GCAGGAGCC7C7GA 

GGAGAAAATTGACCTTTCTTGGCTGGGGCAGAGAACATACGGTATGCAGG 

GTTCAGGCTCCTGACGGAGTTGGGGCAACCCTGGAGATAAGCTCACACAA 

CCCTGCAAGACCAGGTGCTGTTACCCTAGCCAATCTCATGGATGAACCAG 

ATCAATGCCAGATGAGCTCTGCCTAAAATGATTTTTTGGTGAACTCTGAA 

AAGTGGAATATTGTTTCTGTAAGAATATCCATCTGAGACTCTATCTCTTG 

GTAATACCAAGAGTTATCAGTTTCTCTTTAACCGAGACACCAGCAAAGTG 

CCTGCTCCAGGGTAATGCCCAGGGGAGCCCTCCATTTGTAGAATGAATGA 

GAGTCCAGGTTATGAACAGTGCCTGGAGTGTAGGAACACCCTCCTTTGCC 

7C777GACAGG7C7GCA7CA7AACAC777777777777777GAGACAGAG 

7C7CAC7C7G7CGCCCAGGC7GGAG7GCAG7GGCACGA7C7CGGCCCCC7 

3CAAG77CCGCC7CCCGGGT7CACACCA77C7CC7GCC7CAGCC7CCCCA 

3CAGC7GGGAC7ACAGGCACC7GCCGCCACGCCCGGC7AA777777G7A7 

7777AG7AGAGACAGGG77TCACCA7G77AGCCAGGA7GG7C7CGA7C7C 

C7GACC77G7GA7C7GCCCGCC7CGGCC7CCCAAAG7G77GGGA77ACAG 

GCG7GAGCCACCG7G7CCAGCCTG7AACAC77CT7A7AGCAC7GAG77GA 

AACC77GC7CC7CC7GG7TCC7CCAGGAAAC7GAAA7C77777GAGCCAA 

G7C7AGCACAG7GCC7GGCATGTACA77CAGG7GG7AGAG777GC7GC7T 

GAA7GGG7GAA7GGGAA7TTGACAGCA77777AT7CAAA77AG7A7G7GC 

CAGG7A7CG7GC7CGCTCTGCATTA7CCAASGGAGTGAGCCTCTG7GCAA 

G7A777GAGACACGAGGGAAA7AGG77C7AC7G7GGGAAAAAGAGCA777 

CA7GGAC77GC7C7CCAAGCAGCCT7C7GArr777 A AT77GGC7CCCAG7 

A7C77GA7A7CAGGAG7CAGTGACAAGAAC7CCA7C7TTAG7AAG77A7A 

7777CCACAGGAAA7CTAAAAGC7G77CAACA7G7TAG777CCTG7GAA7 

77GA7AAGCCA7AA7CCAT7CC7AACAC7GAGCCCTCC7GAAA7TTGG7G 

7C7GG7CC7GCAGA7AGC7AAAAGCCC7G7C7GGG7GGCC7AGGGACTCC 

TC7G7777GCC7CCACAGGA7CCAC777GCAAA77AACCAC7GG77C7CC 

CG77G7AGGAAC7GCCACCT7CC7CAGAGCC7G7C777C77CC77CC77C 

C77CC77CC7C777C77777C777C7C7CTC7C7T7C7TrCTrrTC7TT7 

C777 C777 C777C777C777C777 C777CT77C777 C777 C777 C777 C7 

TCC777C777C7C777C7CTC777C7C7C7T7C7C777C777C777C7C7 

37 CC C7 C C C7 C C C7 C7C7 CTC777C777C77777 C777 C7777C7C7777 

C777C7C7C777C777C7CCC7CCC7C7C7C7C77777C777G7C7C7CC 

37CCC77C7C7C7C7C777C7C777C7C7C7C7C7C7C7CC7AGACAGGA 
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TCTACCTTTATCCCCC^jarrGGAGTC 

7GCA7GA7CACAGCAGCC7CAAACCC77CC7CAGAG7C7T7A7GCGGCAA 

CCAGCAGGG7C7GGAGGG77GG7GGC7C7G7GAAC7C7CC7GACAGAACA 

~AGAGA7G7C777GG7C7G77GA7G7GA77ACAAGC7GAACGAAGGAAGA 

T CAAAGC GAG7GACAGGAAGGGAGA7A7G CAAGGGAC CCGAGCA7CAGCT 

CTGAGTTAGT C CATT CTGCTTCTGGGACTTGGGATACAGGT CAGAAAC CT 

TGAGC77C7AC77C7CCATC77CCAA77G7AGCA7CCAGGACC7CAGAA7 

C7GCCAGC7AAGAGGAGCCC7AA7GA77G7C7GG7GGGA7A7GG7GGGAC 

CACAGAGA7GAAGACATGAATAGCTATTTGAATGTGAACAGCAGACGAAG 

AAATCAAGGCTAGGAGGGTGGAAGTGACTCATCCAATAGCACAGTGTGGT 

TGAAGCAGCACTAGTATCCAGGTTGCATGAGCCCCTGATGCTTTCGCTCG 

AGGGAAA7777GGAGCCATGGGGCAA7GCCCCC7GACG7AACAG7C7CCA 

CAG77C7GCCA7G7C7CATCC7GGCCC7G7AACC7GGACCCAAA7C7GC7 

ACCA7CCCA7CCA7C7CAGGAAG7GAAACC7C7TA7G7CAAA7AGG77G7 

GCAACG7A7G7A7CAGA7CC7GTC77CCCAAGGAGACCGC7CAGGCCACA 

GCAC77CC77CCGA7CCCCAA7GAGCAGAAAA7A7C7CGC7A7AAACA7A 

G77GGCAC7AAGGGAGGGAGTGGAAGAG7GA7GA7GA7G7AGA7GG7GA7 

G7AGCCCCAAGGAAG7GGAACAAGCAGAGA7GGGGAGC7GGAAA7GCCAG 

GA7GC7CCAGC77TTGGGGAA77A77CAGC7C77GAG7CACTAAAGCC77 

7C7CAGC7GCAAG77CCTCT7TACCC7G7CAGG7CA77C77CCAAGACAG 

GAGAC7GACA777A77CAAAGCAGCAAG7GCCC7GA7ACCA7C77G7G7C 

7AA7CA7GGGC77CGCAGCCAG7TA7CAAGG77GA7C7CA7C7CA77GG7 

C77CAA7CA7777GAACAAGAAGACAAGCAAAA7AA7CA7GGG77AG77C 

77A7A77A77G7G7GTACA7GCAG7GA7G7C7G77C7T7G7AG7GAGC7G 

77CC77CC77G77CACCC7C77GC77AGAACAGAAC7AAGCAA7C7GCCC 

CGAACA7777CCCCAAT7TCCCATC7CATTC77GGCAC7GGCTTCC7AA7 

A777G77C77A7GAG7CA7TT7CTTG7A7CA7T7CCATGAGTCCC7C7GG 

GA7C77AAAG7ATGAAAAATGT7G7G7G7ACCCACACC7G7CTTTG7GGA 

7A7T7C7C7CC7TTCCCTTC7GCT7C7GGGA77A7T7GGGAA7GGGCACT 

A7GA77777A7CA7A7CGC77CCAC77CC777A7GGCA7CA7C7CCAA7G 

GGCTTC77C7CCCTCTTGGATCCAGGTTCTCAGATTGGGGACATGCAGAG 

7CCAAGGAACA7TCCATTCTCC7CCC7GG7C7AGAACAAGGAGGGC7TAG 

ATA7ATGAGCAGG7GGCTGGGGC7GGCGAGC7ATGTAG7CTCCAA7GGC7 

7TTCCC7GA7G7CGGAG77GTTATG7CAG77C7GGGAGACCAATAAGACC 

77G7CC77CC7TTGGA7CCATCAGAAAAAGCCCC7GGGTGGG7AAGA7GG 

ATGGCAGGGCTCTCCTACTCTATGT C T T TTCTCACACCTAGTGGGTATAA 

GAGAGGGGACCACAAACAGAGGGGGCTCTGGTACCACTTATCCAGGGTCT 

GGAAACA7777C7G7AAAGGGCCAGA7AA7AAA7G777CAGG7ACAAC7A 

Z7CAACC77GCA7CA777CAGAAAAGCAG7CAGA7AA7ACA7AAA7GAA7 

3GG7G7GGC7GGAC77G7CCTGCGGTCCCC7G7C7TATATCA77G7A77A 

7A7CA7 777T 7CTTACATACAAA7TTAGAAGCAATAC TTAAAAAAA AAAA 

GCCG7CC777A77GAGCACCTAC7AAG7GCCAGG7ACCTTTT7TTCCC7C 

A7TA7C77A77AAC7CTTCATAA7AACCTT7AAAG7AGATAATAT7GAAC 

C A777G AC C7 A7GCAGAAAC7GAGG77GAGACAA7AAA77A777AAGACC 

GCACAAACAG7 AAAT GCTGGAAC TACGA CTCAAA7ATGGGTTAACTGAAC 

CAAAACCAGA7CTTTATTTCTCAC77TrAATrGrrACATATGTTTATTGC 

C7CA7C7CCTG7CCACATGGTGCCCATCGGCAGAC7CCTTTCTCATTCTC 

AG7GA77GAG7GACA7TCTAAAC7ACA7TGGCC7GGCAGATTCACCTC7G 

7CCCC7AAA7G777CCACA7TG7CC7777AGGA77GAGA7CC7C7C7G77 

CCCITG7C7TCCCTCCTTTCTTCT7C7GGCGGTGACGTGCTG7GTGAA7T 

7G7T7C777C7CCTC7CA GGGTA GTAC7GGGAC777CCAAATCAGGGTrr 

T7AG7GA7C7C7C77CCC7777C7GAG777C77CC77A77CCCA77CAC7 

T7C7CA7C7A7AAG7GGCAGC777G77GC7GGAGGA777CC777G7CC77 

77A77C77C7TTAAGACTTTGTCA7AACTGTCAAAAGCAA7CCC7TGAAG 

G7A7C7G7CCTTGGAATTGTGTGC7TATGA7GCTGAAAAA7ACTC7CT7C 

C7AAAGC7A77A7AAA7GC7 

>Contia42 

GGC7AGC7GCAAC7C77GAA7ACAAACACA77CAGACA7GCACACAC777 
C7GGC7 C 2CAAAAAGAAAAAAAAAAA7CAA7T7A7AA7AA77C7GA7CC7 
77GC77A777CCACAAAC7CCA7GAAAA77GTACA77G7CCAAGCAACA7 
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77C77AA7A77C7CT7V . TCTC7CA7A7CCA7T77CC77AC7GC7GI7C . J 

CACC7A7C7C7TCCAAACTCCC7G77AAAA7CCC7GCCCCAGCGAAC7T7 

TATTCAATTTTGTGGAATGGAGGCTGCACTGATTTAAATTAAAAAAAAAA 

AAAAAATCCCTACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTG 

7777CC7GAGACAGGGTC77G7G7C77CCATGC7GGAG7GCAG7GGCATG 

A7CA7GGC7CAC7GCAGCCTCAACC7CCTGGGC7CAAG7AA77C7CTTGC 

CTCAGCCTCCCCAGTAGCTGGGAGTTCAGGTATG7GCTACCATGCCTAGC 

7AA7777777C777TAT7TTG7AGAGACACGGTC77GCCAGG77GCCCAG 

GC7GG7C7AGAACCCCTGGGCGGACG7GA7CCGCCTGCC7CGGCC7CCCA 

AAGTGCTGGGATTACAGGCGTGAGCCACTGCTCCCGGCCTTGGGTGCAAA 

TTTGAGCTTTCTCA CTTATT AGTGTAAGACATACAGCTAATTTCTAAATC 

77CCAAACC7CAGATTTTTCATCCATGAAGTGAGGA7TATTA7AGAGC7C 

ACTAATAACATGGCTTCAAAAATATATAATGCCAAAATTGAGATCAAAAT 

AATAAATC7ATATTACATGGGAGATCTTAATGTACCTCTTATATTATTGA 

7AGAC7AAGA7GATCAAAAAAA7AGAAAGAGAGCAGTAAGGAGAGCAAGC 

ATT7AA7CAATAGGACCAATACA77TTAATCAATAGGA7CCXCAGGAATA 

7ATACAGAA7ACCAAACC7AACAAC7GCAGAAAACATGCCAAACATTTAG 

GTACAGACA7TGTTGGAAAATC-CAATCTTGAAACGAG7GGAC7GACATTC 

AGAAGA7A77AA7AAGAGCACTAATGATGGGGAT7GCAACCATGTCTT7A 

C7GAC77CCAGAAGCTTCTTACAGTAAACA7GAAATCACATAATT7 C7TC 

CAC777CC7AC7GTTTCT7GTTC7GGGCTC7GTCCTGC77AC7GTCTAAT 

ATC77GGCCCCTTAAAAGTTGCTAA7CTTCCAAACCTCA77CC7G7GAC7 

GGG C CGC7GG7C C7TGTTCATGGGC C77GAAAATAC7GAC7G7ACAC77A 

7C7GGAGCA7CCAGTGCCTACCACC7GACCCAGA77CC7CA77GCGC7CC 

7CCC7CC7CCACCTATTGGAATT7GC7CA7ACCCG7GTGAGACCCC7CCC 

777CCCCCCA7C7GAATTTTTATCAAGACAACGCACTGCCA7ACTCCCTC 

G7ACCC7GC7CTGGGCATCAGAC7GAA7G777G777CCA77GAGGATCTG 

CAGC7GCATCAG7T7CCCCAGCACCGTCCAACCCCTTGAGCA7GGCTAGT 

CCTAAAGCAGAGAATTAGCCTTTCTATCCCTGCTGCTATACATGCTGGGA 

CAAATAATAAGAAATGACAGCATTTTATGATAATGCAGGCTGCAGGAGGC 

AGGAGGCAGGAATCAAATTCGTGCTTATCAAATAGTGCTCCAATTCTTTG 

AATATTGGAC7ATAGAATATGTCATGGATCTATGCTCAGGTGGGTTCCCT 

ATTACTCACTCCACTGAGGCCAGGT7GTGGGATTAGCTGTCCAAGAGGGA 

GTTTCAGTCTCACAGCATAGGGTCATTCTGAGAATTAC7GGCCCACACTT 

GTG7GGAGACCTCCAGAGAACAGAATC7GGGTTGGTGCCATGTACTTCCA 

GGAGGAGAGAAGTGGCAGGATGCCCAGCCCCACAA7CAGAGGGGAAGGGG 

CAGAGCCACA7G7ATGAAGATCC7C7CCCCAG7ACG7GCCAATCACAGGG 

C77CC7AGC7777GGGCCAAGGAAACAA7G7GGGAAGCAAAAAAGGACAA 

7777C7CC7CCC77TGCA7GAAGAC7GAGCAG777TACCAGA77CCCAGG 

GAAACACCC77CCACTCTGGGT7GAA7GTGAG7GAGAGACATTCAGC7GG 

AACAC7AGAAAAACTA7TTCCTGAGCGACTCACCTTTAGCCCTAGAAAG7 

G77GGA77TG7CCTTCATCTTTGCCACAGTAGAGACTGCTGATAGCATCA 

GAAC77GGGC7CTGGAATTAGACAGATATGGGTACAAATCTGAGCTCTCT 

CAC77ATTAGTGTGGGATGTAGAGCAACTTTTAAAATCCTTCCAAACCTC 

AGAC77CTCATGCATGATGTGAGGATTGTAATAGGGCCCACCTAATAGGG 

G77777GAGAA77AAAAAAGTTA7TCAA7GAACAGCA7T7AGCAAGA7GC 

C7GACCATTGAGAAAATAACAAATTGTTTArrATTATTGTTATTATTAAA 

CATC77TCCTGCACCTTCTGACTGGGGGCATCG7ATCATCAGAAATAC7T 

AGGA7GGGATGGATTCCTGCATGGGCTGAGTCAAGGGTGCAATAATGGAG 

GAGTGAAGAAGGAAGAAATGGAGGCAGAAATCCCCAGGAGCCCAGCATGG 

7ACAAGGC7GAGCTAGTGCTGCAGAGCCTCCTTGGAACAGCCACAGAGCT 

7GCA7C7GGCCCTGGGAGGAACC7C7TCTAGC7GGCAGGACCAGCCACAA 

CAG7GGCCAGGGGATTTCCCAGGGCGTGGGCTCCTAGGAG7TCA7TTGGA 

CCAAGCC7GCCTGGAGAGGGGTTATAACAGGGATCCTTCCCTACTGGCAG 

GTGA777ACCCCTCGGTGAGAAGCTCAGGCATTTGTTTGATGGAAGGTGG 

AAGGCCC7G7GC7GGGCCAGTGAC7ATCAGGGA7GGGCGGGTGGCTGGAA 

AA7AGCAAATAAGACAATATGA7AACACAG7TAACCACCACACTATGTGA 

AGC7ACAATATGGGTATCTGTAA7AGACAATTCCAATG7AGAGAATAAT7 

C7AAGG7G7CATTCTCCCCGCCAA7GCCATAAGCACACGGCCTCTGCC7G 

GG777C7CAC7G7GGAATGTCC7CC7GG7C7CC7CA7GCCCAGAGAGTGG 
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GAAGTACTCCTACT7T. . .CACCGGC77TCC7G7CA7C7CCC7GCAGCC 

CC7CAGCCCCC7C7GCACAGGGAGG777CC7CCC7GC7GC7GCAG7GC7T 

7G7AC77G77AG7GG7ACC7GCACACAGG7A77GG7G7CC77G7C7CACC 

ACCC7ACA7CAC7G7AAGCTCCCCAGGAGCAGGC7TCC7G7TTGACTCAC 

C7G7GA7CC7CCACC7CCCACCC7G7AG7GCC7CAAGCA77GAGGACAA7 

CAC7GGC7GCCCC77AACCCAGAAA7GC7GCCGAGACAGGAGGCCA7GGC 

CCAAG77CC7GGAA7GGGG7ATTAC7A7G7CAGCACAAAGGCCT77GCAC 

AAA7GAAGGC777AAAAA7GCAG7CC7AG7CAGG7GGAGGAGGGC77A7A 

GGA77CCCAGGAA7C7GGA7CA77C7C77GAGAGC777CCC77G7C7C7G 

TTAAAACTCACATCCTACGGCCCAAATAACAACAAAAAATGGATGTAAAT 

TC77GAAA7AAC77G7GGA7GGGGGAACAAGGCCCACCCCCCAGA7C7GC 

CAGAAGCTTCAGGTGAGGGTCCCAAATGCCAAAAAGTCTGGTATCAGAGA 

GGATGGCCAGTGACCTGGGGACACATGCCCTTTGCTGTGTCACTCAAGGA 

GCAGCAGCC7CGGCCCCGCACAGTGACCAGGACCC7GGC77CCCACGCTG 

GGCAGGAGCTGGTGTCTGATGAAGGGAATGCCTGGCAGCACGTGCTGTCT 

GTCTCCTCGTGTCAGCTTACCTGGC7TTGCTGCGAAGAGGCCACTCGCAT 

77C7CAA777777A7A7T77T7TAA777777AAA7777TrA^TTrATTT7 

TATTTTTATTTATTTATTTATTTTTAATTTTTTT ll ' A ATTTTTTAAATTA 

TGCTTTAAGTTTTAGGGTACATGTGCACATTGTGCAGGTTAGTTACATAC 

GCATACATGCGCCATGCTGGTGCGCTGCACCCACTAACTCGTCATCTAGC 

ATTAGGTATATCTCCCAGTGCTATCCCTCCCCCCTCCCCCCACCCCACAA 

CAG7CCCCAGAA7G7GATG77CCCC77CC7G7G7CCA7G7GA7C7CAT7G 

AA777C777AAAGG7GGAA7C7C7CAG7GGGG7C7AA7C7G77CAGAAA7 

A7CAAAAGAG7A7CC77GGGAA7GAC7GGAA77CCAGAG7CA7C7GGTAA 

TCC7CA7AAAACAAC7CC7GGA7G7C7C7CAGCACA7C7CCCACC7TGAA 

CGCAGGAGGCTGGTTCAAATGGAGGAGCATCGCTCTACTGCACTTTTTTT 

TTTTTTTGGCCTAAAGTGCAAAAGGGGATACGTTTCATGTAAATAAATCA 

ACTGCAAATCGCTAGTTATGCTGAGCCCTGTCCCGTGCTGTGGACACAAA 

GGAACCAAAGGCTTTTCTCCCCGCCCAACACACACATAACACACACACAA 

AATCATAAAAACATACATACCCCCAACACATAACAACACACAACACACAC 

ACAAAATATATACACACAACACACACCAAACATGCCCACAAACCTGTGTC 

CAGAGATAGATCCTACTGGTGGGTTTGTGGTCTCGCTGACTTCAAGAATG 

AAGCCGTGGACCTTCGCAGTGAGTGTTACAGCTCTTAAAGATGGCATGGA 

TCCAAAGAGTGAGCAGTAGCAACGTTTACTGTGAAGAGCAAAAGGACAAA 

GCTTCCACAACCCAGAAGGGGACCCCAGCAGGGTTGCTGGTTGGGGTGGC 

CAGCTTTTACTTCCTTTTGGCCCCTCCCATGTTCTGTTTCCATCCTATCA 

GAG7GCCC777777CAATCCTCCC7G7GA7TGGC7AC7777AGAA7CCTG 

CTGATTGGTGCATTTTACAGAGTGCTGATTGGTGCGTTTTACAATCCCCT 

TGTAAGACAGAAAAGTTCCTGATTGGTGTGTTTTACAATCCTCTTGTAAG 

ACAGAAAAG77CCCCAAG7CCCCAC7GGACCCAGGAAG7CCACG7GGCC7 

CACC777CAAC7CCA7AATGGCATGAAAA7ACA7A7G77G7ACAAAACA7 

ACATACACAAAGTATACATGCATCTCCCCAAATATACACATACCACAGAA 

ACATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGC 

CTCTGCAGTGAGTAGTTAGAAAAGTGAATTTGTTT T TCAATAAATTGGAG 

TCCTTAAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAA 

AATATGTATGTCCTTTG G TCT A GCATTTACACATGTAGGAATTTATCCTA 

GTGGAGTAATCAATGATATATGCAAAGATTTGGACAAGCATATTAAGCAC 

AGAATTATGTATGCATATGTGTGTGTATATATATATATATCTCATACATA 

TAATAATGTAAAAGTGAAAATAACTCAGATGTTCAAAATTGAGGATTAGT 

TAGACTATGATCTGTCCATATGTGACATACAAGTTAGCTGCCCCTTATTC 

TCTCGAGCTTCAACCTCCTATAAACAGTGTCCCTTGTATATCAGTATTGG 

TACAGATAATCGAACTTATTGAGGTTTTACATGGGGCAATAAAGGCAAGA 

GTTTATGAATACTCCATACTACACTAGGTAGCACCCCCTATTAAAGACAA 

ACTCTTCTCTCTCATTTCCCTTCCTTTCCGGAACCACTTGGTTGAATCTC 

TACAAGTCTCTATTGCAACTGCCTCAACATGGCACCCTCCCTGCATCTCC 

ATCTTCCCTGTCCTGAGAGCAATGGCCTGCTGCCCCCACACTCACATCCT 

CATTCATTCCAGAAGTGAGCACCACAGAAGTGCCTACAGTTACCCCAACC 

ACCTTCTTAGAAGATAAGT7AGTGTTTGTTTTGA C TTTTTAAAATTTTT A 

C77CC7C7777CC77CACAA7C7CA7CCCA7CCCAAGAGG777ATCAAGA 

AG77C7C7AAAGA7A7G7G7CTCC77A7GGAA777AACAGAAA7CAGGGA 
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. , . o x a * * - -ASjUwA'* v iGGGAATAACATTTTTCCAGGT C" 7« ACiAC 

A7AA7GGAA7ACC77GCAG7AA77AGA7ACAC7A77G7AGAAAAG7AT7G 

A7GAAA7GGAACGA7G777GAGA7A7CA7A77GAG7AGAAAAGGCAAGA7 

ACA77AAG7AGGAAA7G7A7C77ACAAAA7AA777G7CAGACACAC7CC7 

A7A777G7A7G77A7A7AAA7GCG7A7G7GAAGAAAGGC7AGAGGA7GAG 

ACCACAG7C77CG G7GAAG 7TTAAGAGA7GA7GC7GCAGCA7GC7CAGAA 

AGGCTTGGTATAG7TTTTTCCAGTAATTAAGGACTGATCTTAGGTAAATT 

GTCCATCCTCTCTAAAGTGCACCACCTTTTG7CTGTAAAACAGGAAGGAT 

SG7A777ACCCCCAGGG7CA7CAAAGGA777GG77GGAGAAAAA7AAA7A 

AATGGGC7GAGCCCAGACCTGGCACAGTGAGAGCACAGTGGTTGACTATT 

G7GC7GGCC7G77G77CC7G7G77A77GACATGC7GC7GG7GG7GG7CCA 

GAAGC7A77ACC77AA77GG77A7G7GGATrrCCCC7CA7AC7GAGCAGC 

TG7G7G7GG7G77G7AAAACA7AGCCA7ACACAG7AACTGACAAGGGCAA 

A7G7GA7GGAAAAA7GCAAGGAAG7GCAGA7AAA7AGC7AA7GGGC7G7A 

GAAGGAAGC7AG7CC77GGAGGGC77GATCAAGGAAGG7CC7777GCA7G 

7CACC777GAAGAAGAGGGGACA7AGAAGAGG7A7AG7GCATCCCGGAG7 

G7ACC7GGAAGGGAACA7GAAAAGAGGACA77777C7C7GGCACA7GGGG 

AC7C C AC77GCA7GAAC7C7GGAA77GGGGCAAAGAACCA7CA7GAGAAC 

AAGGGC77CC77GAACC7CCCAGGC7CA77GGCTGA7C7AAACCC7G7G7 

CCCC7C777CC77CAC7CTCC7C7G7TT7CTA7ACC7G7A77A7TGGACT 

GGAC7GGAAGCCACC7GA7C7A7CACAAG7ACC77GAAA7G7G77GAA7A 

GG7G7GGCACAG7CC77AGCAGAG7GGCAC7ACCCCCACAGGAA777G77 

TA7ACC777GGCA7GGAAAA7AGCAGGAAA7GAG7GA7CAC7GA7AAC7G 

AGGA7GC7A777A77A77GGCCAAAGGAA7AC77G7G77G7A777GCA7A 

ACCAC7CACAAAC7G77GA77ACAAA7GAG7ACCAGACC7AGC7CC77CA 

AG7AAAGGA7C77GAGAAC7GAAGGCAAACAGAGC7CCAGGAG7CCAAGA 

CAGAGCCACAGACCACGAGGA7CCC7GGCCCAGG7AGGTGG7CC7CC7GC 

AC7GGC777CAAGGCCAACAGGA7GGA7GGGGAAG7AGAG7AGCA7C7GG 

CCA7C7AGACCC77GC77TT7ATCCCCAC7GGAAGCACA7C7GAA777CT 

AAATA7GA7C7CTGAGACCTGCCCAGAACACC7TGCTC7CAGCCCCAGTA 

GCAGCC7GCTC7C7CCCAGGAGGGC77CCACTAACAAGTAGGGCAT7GCT 

GGAGGGCCAGGCAGACAC7AGCTTAGGAAA7CCACCAACCC7GGAAATGC 

7AG7CCC77C7C7GAAGGC7CAGAAGAC7GAC7T7AGAG7C7AGAAAA7A 

T7GG7CC77GGGAACAGATTTTGAG7GCAAAGAGATGGACT7CAGATGGC 

CAGA7GCAC7GCTTCTT7AGGGAAT7CTGTGAAAGC7CCC7GCA777ATC 

77AA7ACAGGCAGCAGA7TTCATGAG7ACCCCCGAGGGATGGCCCCAGG7 

CC7CCAGCC7G7GAGCA7CCTTC7G7CC77CAGCAGCACCACAG7A7C77 

7A7A7G7C777GGA7ACC7ACG777C7GCCAGACATC7C77GC7C7GA7G 

77C7GGC7GCCAAA77C7C7G7CAAGCGCC7CCAA777777G7G7CC777 

GA777 AC C C CAACA7GACAAAGGCAG77G7GC77CATG7A77CAGGGA7A 

C7GCCAAACCACAAACAGGTTAAAA7CAAATAGCAGA7A7CCC7G77CC7 

AAAGACCCA7CAGC7C7ACCCACCTGCTCCTGC7CACCG7CC77A77G77 

GAG7CC7GAAGCCC77CCTTGTCATTTT7ATTTT7TGCATGAACAA7TTA 

G77CCC777G7C7CAC7CCTAAACCTTTCTCAAAGGATTGGATTTGTACA 

CAAAC7GCCTA7CTCTGCAATCTTAGAAGTGATATGATTCTGAACAAATC 

ACT7AA C T T T r G AIUlU"riA TTGGTAAGATCGGAATACCA A ' rr ' r ' rTU CTC 

CAC77C7G7CC7ATG7TGGCCTGGGCTGATG7TGAAAGCTCTCGGTCAAC 

7GAGA7AGGG7G7GCAGAATTTA7ATATATAAA7A7A7C7CCTCCAACCC 

C7CCCAA7GAAGCAAGTCACGTGAGTCAA7CCTACCCTAAGATATTAGGG 

ATTGAGCC7CCTGGGACATTTGG7GGCTTAGGTTT7CATGAAAAGAGGTT 

GCAGAGCAAC7GCTTTTTGTTAGGCAAAGAT7AGGCTAC7GCAGAGACTC 

AGCAAACT7C7A7AGAAGGTG7CAGA7GG7AAG7ATTr7AGGCTTTGCT7 

GCCAGA7GA7C7C7CAAC7AG7TAACCA7GCTAT7GTAGCCTCGAAGCAG 

CCAGAGACGA7C7G7AAACAAGAGCATGTAGTG77GGCA7AAA7A7AG7A 

CCGCG 

>Concig43 

GCAA7AAG7C7A777AC7G7AAAGT7AA7CAAA77TACA777CAGAACAC 
77AA7C7GCAAGAG7CC7T7CCAAGACCC7ATACC7AATT77G7G7TTAC 
AA7777A7A777G I ' l ' 1 1C1 r AAAGAAGACCACCAA7A7AAAC7A7ATCCA 
3 C C77 ZA7GA7AAG7ACA7AAGAAAC7A7GCAAA7AAGGGGGAAAAAAAA 
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CAAAGAAAAA7ACC7AL rTACTAATGGTTCACTTCTGAATAGCSCX?^ 

TCATAATGATACAAGCACTCATTACTAGTCTAGGAAAATGAAGATATAAT 

TGCA77AGGAAGA7CAAGAGG7AGGAAA7G7GGA7G7G7G7GG7A7AGAC 

TAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAGATAATTGTTAA 

T ACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCGGT CATTAAGAA 

ACAAGCACTA AACACCA GATAGGAAGCGAGAGATGGGGGAAGAGGGCAAG 

AAT CTG ATTATTTTTTGCAACAAATTTTGTAAAAC CATTTGACTG7TT AC 

ATGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAATAATACTATTAT 

TTTTTAACTGGATTTTTGAAAAAGAAGATAAAAGTCTCATTTTAGTAATT 

AAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTCGAAAATTAAAA 

CTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGGAGTTCGAGACC 

AGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAG 

CTGGGCGTTGTGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAG 

GAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGATCACA 

CCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCATCTCAAAAAAA 

AAAAAAAAAAAAAA7TAAAACC7C7GGAAG77GAG777GCAQA7A77CA7 

TATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATGATGAGAATTGA 

GGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCACAGCCCATTCAA 

TTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGGTCCATCCTTCA 

CTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCACCTCTAAGTTG 

GAGCCCAAGAAGAGGCACTGGGAAATGGAAAAG7CTTTTGTACGXGGTAC 

TTGCCGGGGAAGCTGCCATGAAGACCTGGCCCCACGGTGGGGAGGGAATG 

CCCAGCTGAGGCCTCGTGCCCATGCTAGGATAGACTCGTCCAGACATGTC 

AGGTGGTCTGACAGGGCAAGCAGCAGGAAGTCATGTATGAGTATGAACTG 

ATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGGGGCGTGAGAAA 

ACAGCACAGTACGTTTCTTTAGCAGCTGTCTCTGCTCAGCCATGGGAGTC 

ACCAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGAGATGTGAGTG7 

AAAAAAGTGCCCAAGACACAGTGAGTACCAGGGAGATGCCCTCTTTCCCT 

ACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACATGGTTCCTCA 

GAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCCTGGTCAGCAG 

CAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGCAATGAGCTCT 

CAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCAAGACCGATTT 

TCATGTAAGATGGCCTCTGAGGACTCCGAGATACACCAGGCTGAGACTAG 

CTGGCAGCTCCAAGTTCTTGGTCAGAAGAGAACAGGAACTAGGGAAATTG 

GAATTACTGTTACTACAATTCCTTTACATCCGCACAACCATGAGGTCCAG 

AGAGTCTCTCTTATTTTTTTTTTAAAGACAGGGTCTCACTCTGTCGCCCA 

GCCTAGAG7GCACTGGTGTGATCATGGTTCAGTACAGTCTTCACCTCCCA 

oGCTCAAGTGACCCTCCTG CCTCAGCC T CTCAAGTGGCT GGGACAG CAGT 

TGCATGCTACCAGGCCTGCCTTTTTTTTi'TTlTTl'l-l-il-il-ill 1 !"! i'ili 1 

TCGG7AGAGAC7GGG7C7CTCTG7A77GCCCAGGC7AG7C7CGAAC7CC7 

GGGCTCAAGTGATCCTCTGGCCTCAGCCTCCCAAAGTGTTGGAATTACAG 

GCATGAGACACTGCACCCAGCCAGTATAGTCTTTTAACAGCTTTATTGAG 

GTACGGCTAACATTGAAAAAACTACACAAATGTAAAGTATGCAATTTGAT 

AATTTTGACAAATGTACACACCAGTGAAACTATCACTACAGTCAAAATAA 

TGAACATATCCATCACTCCCAATTTCCTCACGCCCCTTGGTAACCCCTCT 

CTCCCAACTCCCTGCCCCCTAACATCAGAatACTACTGATGCATTCTGTC 

TCCATAGGCTCATTTACATTTTCTAGAATTTTACATAAATAAAATGACAG 

AGTATATACTCCTTCATGTATGGCTTCTTTCAGCC CAATT ATGTCAAGAT 

TCATGCTTATGGCTGTGCGTATCCTTAGCCCATCTCTTTGTCTTGCTGAG 

TAGGATACCATTGCATAGACAGACCACAGCTTGCTCATCCATTCACTCTT 

GACAACG77GAATTGTCTCTGTT7T77GCAA7GACAAATAAGGTTGCTAT 

GTACATTCCTGTATAGACATTTGTAAAAGCACAGCATTTCATTTCTCTTG 

GGTAAAGACCTAAAAGTGGAAAGGCTGAGTCATATGGTAAATATATATGT 

CTAACTTTTTAAGAAACTGTCAAACTGTTACCCAAAGGGATTGTACAATT 

TTACATCCCCACCAGCAGTGTATGAAAATTCCCG7ACTTCCACATCCTCA 

CCAATATATGGTGTGGTCAATCTTTTTAATTTTGGACATGNTAATGAGTG 

CAAAATGAGGCCCAGAGTGTCTGAAGTTACATTTGTATCCTTTTTGGCAT 

CCAAAACAGGTGTCAAGCATAGAAAAAACACTTGTTCCTTGAATGGTCAG 

TCATT7ACAAGTGGAATTCATTACAAACCGGTAGTTCTACTGGGTTAAAC 

7A7GCC77AC7G7CAACAGGCACA7ACACA7ACAGACAGACAGGAAGGCA 
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ZAGAGACAAGGCAGAGC . . 7GATAAGAAGG7GACCTGGGC7CTX<3C£C1 _ 

GCC7A7CACC7AGTAAAA7A77AG77AAG7AGCCA7GAG7AAC7CAC77A 

ACTTACCACAGGCTCCATTTTCTTATCTGTAAAATAGGAACATTGAAACA 

G CTAATCC C CAAGGTTTGTGGATAATCAGAATTACAAAGATCAATGACAT 

T7C7A7GAGAGAAACATA777CCAAG7A777GA7GGAG7ACA7CAGACAC 

AAAGGAAAGGAAACTGAATATTr rT GAGGTT TT ' rrrrm ' T ACCAAGAAA 

77CACA7777G77AAA7777CAGAAC7ACC7CC7GAGGAAAG7G7AGC7G 

3ACC CA777AGAA7GA7AGAAAACA7CAA7C7G7C7GA77CCAAAGCCAA 

GTTCTTGCTACAACGAGAAATGAAACAACTGGATCCCTACAGATGCAGAG 

ACCTGGGCCCCACAAATGTGAATTCTGTTCCCCTACCGAATAGAGTTACA 

377C CATAATACAGTACTCCCTCA C TTTTCCACAGTCTCACATTCCACAG 

7TTCAGTTACCCACAGTCAACTGCAATCCAAAAATATTAATGAAAAATTC 

CAAAAATAAACAATTCAGAAGTTTTAAATTGTGCTCCATTCTGAGTAGCG 

TGATAAAATCTTGTGCCACCATCCCACCTG7CCAGCTTATCGTTAGTCAT 

TGACATCGTCTGCTCCTGACATCCAACCATTGACATCATCATGACTCTAT 

GATCCAGGATCACCGAAGCAGATGACCCTCCTTCTGACATATCATCAGGC 

CAATATCAGCCTAAACACTGCATCACTATGCCCACATCAGTCACCTCACT 

TCATCTCATCAAGGAGGCAATGGATCACCTCACATCATCACAAGAAGAAG 

AGTGGGTATAGAACAATAAGATAATTTTGGGGCAGGCATGGTGGCTCACG 

CTTGTAATCCCAATACTTTGGGAGGCCAAGGCAGGAGGATCCCTTGGGCC 

CAGGCATTCAAAACCAGCCTGGGAAACATAGTGAGACCTCCTCTCTCTGC 

AAAAAAAAATAAACAAAATTATCCAGATACAGTGGTGCATGCCTGTGGTC 

CCAGC7AC7CAGGAGGC7AAAG7GGGAGGA7CAC77GG7CCCAGGAGG7C 

3AGGCAGCAGTAAGCTGTGATCGTGCCACTGCACTCCAGCCTGGGCAATA 

AAG7GAGACC C7GTCT CAAAAAAAAAAGG7AA77T7GAGAAAGAGACCAC 

ATTCATACAACTTTTATTATAGTATATTG7TAGAATTGTTCTATTTCATT 

ACTTATTGTTGTTAATTT ClTrcm 'GCCTAATTTTTTTTTTTTTTTTTG 

AGTCGGAGTTTCACTCTTGTTGCCCAGGCTGTAGTGCAATGAGACGATCT 

CAGCTCACCGCAAATCCCGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCA 

GCCTCCCGAGTAGCTGGGATTACAGGCGCCTGCCACCATGCCCAGCTAAT 

7TTG7A7777TAGTAGAGGCGGGG777CTCCATGTTGGTCAGGC7GGTCT 

CGAACTCCTGACCTCAGGTGAGGCCTCAGCCTCCTAAAGTGCTGGGATTA 

CAGGCTTGAGCCACTGCGCCTGGCCTCTTTGCCTAATTTATAAATTAAAC 

ATTGTCACAGGCATGTATTAATTTATAGGAAAATCATAGACATATAGAGT 

TGGGTACTATCCACAGTTTCAGGCATTCACTGAGGGGCTTGGAACACGCC 

CTCCTCAGATGAGGGGGGACTACTGTCATCTCCTCAATCATTCTTGATTC 

AATCCTCAACACAAATGGTTTGGCCAGGTCTTGCCTCTGGAGACAAAATT 

GCTAAGGATTTAGAGGGGAAAAAATGTAGTTCACTGGGAAAGTCACCTCT 

3C7CCAC7GGACAGCAAC77AAAACCCAGGCCA7GACAAG7AGAAAGGCC 

ACCCCCACTCTCCTTCACACC'TGGAGTATTCAGGAGTCAATCATATTTCA 

GGACCACCAGGAGCAAACTGGGAAAAACTGAGCTGCCTTGAGGAAAGCAA 

TCAGCTCCACAAGGGGCTTAAGAAACAAGCTCTGGGAGGAGTGGTTGGAG 

AAGAGTTGGGGACACATCAGAAATGCCATCAAATTTCTAAGGGCTACCTC 

GTGGTGTCAGACCTGTGCATCTTCAAGGACATAAACAGATGGGATAAGCA 

GATGAGATTCACAGAGGACATCAAAATATTGGCTCCCCAGAAGGGAGAAC 

ATTCTAGTAACAGAGCTGCCCAGCTGCAGAGTGGACTGTTTCACAAAGCA 

ACAGGTGCCCTGCCTCTTGAATCACCATCTTCACAGGAATGCAGTAGAAG 

GGACTTAACTCCTGCCCTGAAGAAAAGGTTAGGCTAGGGAAACAGCTCCA 

AAATTTTTTAAAAGGAAGCAACATAGGCATCTACTGGGAGTTTTCTAAAG 

CCTITGTTTAATGAAACTAAAGAGCTGGGACAGGAAATGCCAAATTAAAT 

TAATAGAGCCTTGCTTTAAGACAATGCAAGTGGATGGTAATGAAGGAATG 

AGTCTTAGGCCTTGGATCAACCGTATTAAGCAATGCTGAGCATGGAGCCA 

ATTCTGTTCACTAGATTTGCTCAGAAAGGGCCAGACGAGAAGGATTTTTC 

TAAAGGCACCTACTACCAAAAAGCTGCCAAGGCGTCCAATGGAGCCCAGA 

GAGAATATGCTAACAATAAAAAGTTGAACACCCTCAATAAAAAAGGGTAA 

AAGTAATTAATAGAAAATTACTGAAAG L1 IT ' l ' l 1 GAAACCAAAAGTAGTC 

AGCATTGGTAAAAGTCTACAAAAGTGGACACTTTCATATAATGTTGGCAG 

GAGGGTAAAAAGACATAACCTTTTTGGAGGACAATTTGGCAACAGAGTAC 

CAAAAACCTTACAATTGAAGAGAACTTTGGCCTGAGTGCAGTGGCTCACA 

C CTG7AATGCCAACACTTTGGAAGGCCAAGGTGGGAGGATTGCTTGAGCC 
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CAAAAG777GAGACCAGL07GGGG7AACACAG7AAGACC7CG7C7C7A7^ ' 

AAAAA7AAGAAAAG77AGC7GGGCA7GG7GGCA7G7GCC7G7GG7CCCAA 

C7AC77GAGAGAC7GAGGCAGGAGGA7CGC77GAGCC7CGGAGG7CAAGG 

CTGCTGTGAGCCATGTTCATGCGACTGTTCTCCAGTCTGGG7GACAGAAT 

GAGACCCTGTCTCACCAGAAAAACAAGGCAAGAGAGAGAGAGAGAGAGAA 

GGAGAGAAAGAAAAGAAAGAAAGAAAGAAAGAAAGATGGAAGGAAGGAAA 

GAGAAGAAAGAAAGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGA 

AAGAAAGAAAGAAAGAGAAAGAAAGAGAAAGAAAGGGAGAGAAAGAAGGA 

AGGAAGGAAAGAAAGAAAGCAAGCAAGCAGGAAAGGAAGGAAGGAAGGAA 

GGAAGGAAGGAAGAAAGAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA 

GAAAGAAAGAAAGAAAGAGAAAGAAAGAAAAAGGGAGAGGGAAAGGGAAA 

AGAAAAGGACAAAGAAAAGACCTTTGAACCCTGAATTTCACTTTTAGAGA 

TTCATCTTAAGGAAATTCATTCCAATAGAAATTTATCCCCAGGATTATCT 

AAATATTTGCTTTTATTTTCTTCTAGTAATTTTATGGTTTAACTTTCTCA 

TG777AAGCC777^777A7T7GGAA777A7777GG7A7GAGAAAG7G7G 

ACC7777777G7777AC777AAAAAAAA7G7A77ACGA77ATXA7777AG 

AGACAGGG7C77GC7CTGTCACCCAGGC7AGAG7GCAG7GG7G7GA7CA7 

AGCTCACTGCAGCCTTGAACTCCTGGCCTCAAGCAATTCTCCCTCTTCAA 

CTTAGGAGTAGCTGGGACCACAGGCATGTACCACCATGCCCAACTAATTT 

TTTTTATTTTTTGTAGAGACAGAGTCTTGCTTGTTGCCCAGTCTTGCAAT 

G77G7C7CAAAC7CC7GGGC7CAAG7GA7CC7G7CGCCCCAGCCTCCCAA 

AGCAC7GGGA77ACACG7G7GAGCCAC7GCGCCCAGC7GCC77777A777 

T77AA77777CAGA7GC777G77GG77CCAAAA7AGCAC77A77AACCCA 

CGC777CCCCC7C7GG7777AAA7AC7GCAAG777GGC777GAAA7ACAA 

CCCACTGCCTTATTCAGGCTACATTCAAGGAAATCTGAGACCAAGAG7CT 

GAAGGCCCAGTTTCCTTCCTCAAACCCAGGAGGTGGTAAATGTGTCACTT 

CCACACTTTCTATCTATTTCTAAGAACTCCTTCTTrCCAAACTCTGACAT 

GCCCCTGGCTCAGGTCTATAGAAATTCCCAGGGTCCACAGACAAAGCAGA 

ACTCACTTATGGGGAAATCTGGGAAATACTTATCTGTTAAACCTGCCCCA 

TATGGTGACTCAGATTGTCTAAAGCCCAAAGCATCATTTTCCACCCCAAA 

CCATTTCCTCCTCCAGACTTCTCTATTTCTGTGGTCCAGAGTCAAGATCT 

TGATATTACCCTAGAGTCCCCCTTCTGCTCTCCTGCATACCCAGATGCCC 

CTCCCTCCCCAGATCCATTCTCCCACCCTCCCTCCCATCAGTTTGGTGGG 

CCC1ATCACCGCTTCCCCTGGCCCAGGCTCTCCTTTTGTGCGCTTGGAGCA 

GCAGACTGATCTCCCAGCCTTCACTCACTTCATGTGGTAATCTGTTGTGT 

TCATCACTGTCAGAATCTTCTGCATCCCCTCACTACTCTGCTGAAAACAC 

TCTAGTGG7TCCTCATTGCTCATTAATGAAAGTCTAGATATTAAACGTAG 

AAGGCCCAGCACAATTTGCCCCTATGCCACCTACCTCTCTAATCTTTTCT 

CC77ACTCTGACAGAC7C7CCG7C7G7CA777A7G7A77C77 7 7 A 77GC7 

CTC77C7AC7777AG7A7GAAC7GGA777A7GGA7777777AACA77GC7 

77CAAG7A7GGAA7AAAGAAT7T7A777A777A777A777A777ATrTGA 

GAC7GGG7C7CAC7C7G77GCCCAGGCCAGAA7GCAA7GG7GCAG7CA7A 

7C7CAC7G7AACC7CGAAT7CC7AGGC7CAAGCCA7CC7CC7GCC7CAGC 

C7CC7AAG7AGC7A7GACTACGGG7G7GCA7CACCACA7C7GGC7AA7GG 

AA7AAAA7A77ACAA7GCCTAATC77AA7T7TCAAAA7777AAA7TACAT 

7G7ACC7AA7GCCCA7GCATTTAC7TT77TCAG7GGG7CAA7AGCCCTCA 

C777GGCAAAGG7CCCAGGCCCAAGG7AAGGCC77A C 77777CCAAAC7C 

A7C77T7GAAAGACA7AAG7GCC7G7AAGTTG7ACCACA77AGG77C7AG 

GAA77777CA7CAAAGACTTTA7CAGAC7A7777CC7C7AAG77GAGAAA 

GAGC7GGGGGCAGAA7A7GGCAC7GAA7GAC7GAAGAGAAGGCACTGAAA 

7CAGGCCAGAGG77GC7GGAAAGAGCAA7GAGGAACACCAGCAGCAATGA 

GGAGCCGG7GA7GA7777GGC77CACAGGGAGG7G7G7ACCACACCGA77 

77A7C7C7ACG7GGA7GAACCACAGC7G7CGGC7CCC77G7C7CCAGGAC 

A7CACAC7C7CCACA77CCC7CCCA7C77CCGGC77C7GC77CCCGGGGC 

CCTCA7C7GCCCCA7CC7GGG7GAACAC7GG7CGG7CAAC7GCrGGGCG7 

ACC77CCCGC7C7GCACACCC7CCC7GGCCAGCCCACCCAC7C7CACGGC 

7CGCAC7GCAGAGGAGCCGCA7C7C7AGC7CCAGCCCA7C7GCC7C77C7 

GAGC7C7AAC77CA7G7AGGCGAC7CC7GCCGG7G77GCC7CACAGGCCC 

A7CA7AC77CAAAGCA77T7CCCC7CAGAACACCA7G7CC7GGC7GC7CC 

C7CCAGAAGA7ACA7C7C7CAAGCACA7CCCCGCGGC7C7CACC7GGA7G 
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ACTGCATTCACCTTCTC JACATTTGCCC7CCTTTGGATGTATATAGA . 

GTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAAACACCTCA 

AAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACAGGTTGCCC 

ATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCCCTTGCCCC 

TCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTCTCTAACAC 

ACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTCCCATTGCC 

TGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCATCTGATTAA 

TOCCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTCTCTCTGAC 

TTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAGCACTCTGT 

ATTACTCCTTTCTTAATGACCACCTGCTGTAGACTGAATGTTTGTCTTCC 

TCCAAAATTCATATGTTAAAACCTAGCCCCAAATGTGATAATATTTGGAG 

GAAGGCTCTTTGGGAGGCAGAGCCCTCATGAATGGGATTAGTAGCCTTAT 

AAAAGAGACCCCTGAGGGCTCCCTTGTCCCCTCOVCCGTGTAAGGATGCA 

ACAAGAAAGTATGGTCTATGATCCAAAAAGCAGACCCTTGCCAGGTACCC 

AATATGCTGGCACTTGAACTTCCCAGCCTCCAGAACTGTGAGAAATAAAT 

TTCTATTTTTCATAAGCCACCGAGTCTATGGTATTTTGTTATAGGAGCAC 

AAACAGACTGATGTGCCACCCAACCATGATTATACGTGTAATTTATGGTT 

TCTCTGCTAGTAGGGATGCAGCATGGGGTTAGGAACCACGCTTTTCTTAT 

TTCCCACACAGTCCTTAGCTCTAAGCATGTTCCTGAATCAAAGATCCCCA 

TCTTTTATGAATGAAGGAGTCAGTGAATGAATTAATGAAAGAACTGATAA 

CCCTCAATAATTATTCCAGC CTriT ATACCT ACTAT TAACAAGCTTGCAT 

TCTACTCCAAATTTATTGGGCTTTAACTCTATTTTTGGCCAGCGViblTTT 

GACATTCCCTGAAGTAAATCTATGCTTTCCATCCTAAGTCAAGGAAGGAC 

CTGGACTAGTAGGGCCAAGAAAGGTCTAAATTCCATGGGTGGGAGAGAGA 

GACTAAATCTGAAAGGAAGAATAGATTGAGCAAAGGTGTAGAGATTGGGG 

AAGGCTGGACATTTGGAGAGAAGGAAAAGGAAACTGACACTAAACCAAAC 

AGTCTCACAAACACAATCTCATCCTTCCAAAACTCTGTGAAGTAAGAATT 

ACTATCCCAGGGCCAGGCACAGTGGCCCATGCCTGTAATCCCAGCACTTT 

GGGAGGCCAAGGTGGGTGGATCACCTGAAGTCAGGAGTTCAAGACCAACC 

TGATCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCTGG 

GGATGGTGGTGCACACCTGTAATCCGAGCTACTTGGGAGGCTGAGGGAGG 

AGAATCATTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCAGAGATCGTGC 

CACTGCACTCCAGCCTGGGTGACAGGGAGACTCCGTCTCAAAAAAAAAAA 

AACAAAAAAAAAACCAAAAAAAAAACAAAAAACAAGAATTACTATCCCAG 

TTTTGCAGATGAGGCAATGGAA GCTCT AAAAAGTTAAGTAGGAGAAACAA 

ACATGAAATGTATGTCTTATGCTTTTCCTCATCCTATTTCCTCAGCCTGG 

AATGTCCATTCTCCCTCCACTATGCAAATCTAACTCTTCAAGCTAACACA 

TAGCAATG7CTGAGAAACCGTCCCTGTGTTCACTCTGTTAGCCTCACTTG 

CTCCCTCCCCATCCCTCTGTTTCCTTTCTGTTATAAC^CTTCTCTATTCT 

GCTGGCATCACAGTCATCTCCACCTGCCTTCCTCACAAGTTAAAAGCTTG 

TTAAGGGCAAGTGGTGTTCTTTGCCACCTCATTCCCCAGGGCTTCTAACA 

CAGTGCCTCATGCATGACAGAGTTGTAAAACAGGTTACCAAGCTGGCTTC 

AGGCAGGTTTGCATGGAACTGTGCTTTACAGGAATACCTGCTCCCCCCAG 

GCCCTGGGTCTTCCTCCTGAGTCCAGGCTCAGACTCTCTCATCCTGCTCG 

TTCTCTCTTGGGGAGCCACAGTAACTTTGAGCAACTTTGCATGGGATAGA 

ATGGCCTATTAGGGGCAGCACAAAGACCCCATGGAGGGAAGAGTACAGAA 

AGGGAAAACGATAATCAT All ' Tll ' Tl ' A AGATGTGC AllTrC ' l TAACAAAA 

TGCTCTAGTACTTGTCCAGACTTTCAAACTCAAAAACCTAAGCGTCCTTT 

TCTTGAAGATCATCAAAGGCCCCAGTGGTCCTTCAGGTATGTCAAGCTTT 

CTAGAAAATAAAGGTAAGTCATAATCACTTAACACACATGGCTAAATGGC 

CATTTCCTTCTAATTTATCAGCAACTGTTACATATTTCTATACTAGAAAA 

AATTTATATTTATACTCAGGGTGGTAAGTTAAATTTGCCATCGAAGTAAA 

GCAGAAAGAGCGTAGCATGTATGTATATGTAACTCAACTGTGCATGAGAC 

AAAGATGTCTTGAGGAGAATGAGTCTAAGATGCGCCTGAGCAATAGTACC 

C 
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>Ccncigl 

GCACCCATGTTTCTAAAGGGCATACCAGCCATAATAACAGGATGGGTGAG 

GATATAGACAGCAGATGACAGAGAGGAGAGTGAAAGCTGGGAATCCCAGC 

TAAAGGCATCAGGTTTATGGAATGAGTAGGGGACAATACTGTGTGTGTTT 

ATACACACATGTATATGTGTGTATATGTATACATGTTTATGTATATATAT 

AATTATATGGTACGATTTCTAATTGACAAAATAATCTATCACATTTTACA 

TTATCAGATTTTACATCTATTGTTCTAAATACACTCAGTCATCAGCCCTG 

TGTGTGGGCTCTTACCCATCCCCATGCACACCTCAGCTCAACCACTGATG 

GATGGATCATCTGCCTATCAGAGGTGGCATATTCAGGTGAATCCATGGCC 

ACAGCTGCAGCACTTCCTACCCACGCAGAAAGGCTCCACAAGAGGAGGCA 

CACCCGCTCTGACTGTCCCTAAGCTCCTGACATCTTCACCCCATGAAACT 

GCTGCTCCTGGGTGCTTCCTGCCTTGCCCTGCCCACCCTTGTACTGTTCT 

CACCATTGACACAGCTGGTGCCCGATGCAC 

>Contig2 

NAAAACGAATCGTCACTATTGAAGCCTGTCTCTCANCGGATCGTGACTAA 

GAACCCCCTCCTTGCTTCAAGTTGTCCTGCCTTTCTAGGCAGAGCCACCC 

TACATCTTAAATATATTGATTGATGACTTACGTCTCCCTAAAATATATAA 

AACCAAGCTGTGCTCTTACCAACTTGGGCACATGTGGTCAAGACCTCCTG 

ATGCTCTTGTCATGAGTGGGTGGGTGTTCTCAACCTTGGAAAAATAAACT 

TTCTAAATTAACTGAGACCTGGGTCAGATTTTTGGGGTTCACAGCAACAA 

TTTAAAAAACTCACCATTGACCTGAAATTTTGACCTTATGCTGTTGCTCA 

CACTCCTCCATGAAAATAGACGCCATCCTATGAGTTCCCTCAGCCATGTC 

ATGCCACACTTCCAACATGTGTCCCCATCCACCATCTGTCTTCTTATTGC 

TGCATCCTACCCAGGCCCTGATCTCTGGACCCATTGTTGTATAATTAAGA 

ATTTGGGGCTGGGCATCGTGGCTGTGGCTCACTCCTGTGATCTCAACATT 

TTGGGAAGGTGTATTAGTCAGGATTCCTCCGAAGGATGCAACCCTAGGGA 

TCCTCTCTATGACCCTATGTCTA 

>Concig3 

CGCGCTCAACCGACCGATTTGCGCGAACCTGCCCATGCCCGAGGACAGTG 
TAATCCTAAAACGTCCCCTGAATCATAAGGATATGAGTGCGAAAGTACGG 
TTCCCTCTGTCACCACTTTCTAACAACGCTATGTCCGATCCGTGCACTAA 
CCCCGCCCAAGTCACTGAAACACTGATGGGCGCTTCCTCTACAGGTATCC 
. AGGGCCAATACCACTACTCCCCTCCTCCCTGTCCCCCTTCCACTCTCTAG 
AGGCCGCGGATGCCATCCTCTATTAGCACAACCGAAAACGACGGTGAAAG 
TACCACGAAGCTCACGATCTGATCGGTCGCCCAATGCGGTTACAACGGCT 
GTCATCCCAACCCCCGTCCCATCCTCCATATTGCCCCCCCCTATGAGGAT 
GGCCCTATCATCATGACCTCCAAAATTCTGTCATCTCCCGACGTAATGCC 
3CCCCTCGAACGCCTGACACCATCAAGTCNGTCACCTCCCAAAATACTCC 
TCCTAATCACCAGGCCGAGTATCCCCGGTTCCACAATACCTCCTTGAGAC 
GGGCCGATATCACAOVC 
>Contig4 

MGGAGTTTAGGTCAACTAGTAACAAGTGGGATTTGCGACTCAGGTCTATC 

TAATCCTCAAACCCACGTCCTGGACCCCTACACAGACTGCCCTCCCTCAG 

TCCTCTGTGTGGCCTCAAGAAGGGTCTGGACATTCAAGTTTAAAAATCCA 

TCCAAAGAATCTATGGACCCAGTGGTCTCTGGAGTCAATGTTCTGAGGCT 

CAGAAGGGCCAGGCAGGAGGGAGCCGCCTCTACACAGTCCTGAGCAGAGT 

GGGCTGTGTCCCGGCACAGCAGGGGAGATCATAAACAGAATTCTGCCCTG 

GGCC CTATTTAAGTAGGACCTTTAGGCTGCCGGTGTCATGACCACAGGTC 

CCANGTCTGCACGATTGGCTGTGTGTGGAAAATCTTCACTCCTTGCGGCC 

TTGTCCTTGGCAGAGAGCACCGCTGCTTCCTCTGATGGCCACCAGGGGGA 

GGCGCrCCCCTGGGAACGGTTTGAANGGGAGCCTCACCCCACACGTGCCT 

TCCGTGGTACCCAGCACCAGCTGCTACCCATGGTTACCCACAGGCCCAGC 

TCTGCTCTGAA GAAGGAGGAGTGG TGGCGATCANGCCTTGTCTGCATCCC 

GTGGCTGCCCCTTTCTTTTCTTT 

>Concig5 

GGGAGCTAACCGCTCAiCTGGGATTACAGGTACGCACCACCACGCCTGGCT 
AATTTTGTATTTTTAGTAGAGACGGGGTTTCTCCGTGTTGGTAAGGCTGG 
TCTCGAACTCCCAACCTCAGTTGATCTGCCCGCCTCAGCCTCCCAAAGTG 
CTGGGATAACAGGTGTGAGCTACCATGCCTGGGCTTATATGTTTCTAGTC 
CAAACATTTAGC7ACCTTTTTTTTTTTT7TGAGACGAAGTCTCACTCTGT 
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TGCCCAAGC7GGAGCACAG7GGCACAA7CG7GGC7CGC7GCAGCC7CAAC 

CTCCTCAGGCTCAGGTGATTCTCCCACCTCGGCCTCCCTAGTAGCTGGGA 

C7ACAGG7ACGCACCAC7ACACCC7GC7AA7777777G77TT7GTA7777 

TTGTACAGATGGGGTTTCTTCATGTTACCCANGCTGGTCTTGAACTCCTG 

GGC7CAAGCAA7C7GCC7AC77CAGCC7CCCAAAG7GC7AGGA77ACAAG 

CATAAGCCACCATACCCGGCCTACCTACTTTTAACTTGTGGAATTTTCTA 

T AAGG7CANGGATGCCTGNGGGAACAAAAGTTTCT C CCTTGGT ATATGCA 

AGTAAAATCCACATGCTGCCTCCC 

>ConCig6 

AGGACTGTAGCTGTTGTCTAGTCACCAGGCTGGACTGCTTGGCATGATCT 

CAGCTCACTACAACCTCCACCTCCTGGGTTCAAGGGATTCTCCTGCTTCA 

GCCTTCCAAGTAGCTGGGATTACAGGCATGCACTACCATGCCCGGCTAAT 

TTTGTATTCTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGCTGCTCT 

CAAACTCCTGCCCTCAAGTGATCTGCCTGCCrCGGCCTCCCAAAGTGCTG 

GGATTACAGGCGT GAGC CCCCGGCCCACATGTAAAAGTTTATATCTCTGT 

TGTITCACCTTGTTTTTGACCTAGTCTTTCAGTGATTTGAATCTTGATTC 

AGTCTTTTGTTATTTTAGTGGTACTTCCCAGCTTTGTGTCATCTGTGGAT 

GACATATGAGTCTTGCTTCTTCATGCCAATTTAAGAAGACTGAACGGGAA 

TAGGTCAAAGGCATGGCCATGAGCGATTTCTCTCCAGCTTTTCATGGTGT 

TCAGCTTCAAATCTATTCACATATTGGACCTGCAAGCCATCATCTTATCC 

ACAGGC7ATCATCATAGGTGAATGTAAATTGGGTTTAGGTGGCCAXGCTG 

AACGTGAGATATNTTC 

>Concig7 

AGCATGTTCTC7AAAGGCCTATCAAAGCTGACATCAAAGGGATAAGTTCC 

AGTTACCCAGCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAG 

CATGACCTATTCAAGGCCAGTGAAAGAAGCGTGCAACGGCCAAGTCAGGA 

GAACCTGAAATTGTGTCAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACT 

ATTGGGGGTTTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAA 

GGTCACTGGCACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGT 

CTGTTTTGGAAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGG 

TCACAAAACTTGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTG 

TGGACTGTGGCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAA 

GGCATGTGGGAGGCAGAAGGAATCAAATACAATGAGTGATCAGATGTGGG 

GTTAGAATGGTGAGTGANAAAGACATACTCAAGGTGACACGCCCAGGTAT 

CTGGGTGGATGGTAAGACATTCATGGACTAGAATCGAAGAGGAGGTGGGG 

ATGGACATTCCTTCCGTTTAGAGGGGTTCACCAGGAGGATTTGCCGGAAC 

A7GGAGAGGA77AACCAGGAATCCGGTGCC77777CCAAAC7GGG77GGA 

GGGG 

>Concic8 

GG7GAA7GC777GGCACGC7GTG7AGA7777AGG7GACGGG7GG7GACAA 

7GAG7CCG7G7CGAGCGC7G A T7T777CGGCC77TAGAGCGAGA77TATA 

CAA7AGAAT77GGCATGAGATTGGA7TGC7TTTAGTCAGCCTCT7A7AGC 

CTAAAG7C777GAG7GACTAGATGACATATCATG7AAG7TGCTGATAGGT 

7TCCAGT777CCGC7CCTAGGTC7GCATA7TG7AC77TrCC7CTTACTCG 

AC77AACCAG7ACCAACCCAGC77C7CAACGGA777A7ACCA7GGCACT7 

7AAAGCCAGCATCACTGACAATGAGCGG7G7GGTGT7AC7CGG7AGAATG 

C7CGCAAGG7CGGCTAAAA77GG7CA7GAGC777C777GAACA77GC7C7 

GAAAACGGGAACGCT7TCTCA7AAAGAG7AACAGAACGACCG7G7AGTGC 

GAA7GAAGC7CGCCA7ACCATAAGTCGTTTTTGC7CCCGAATA7CAGACC 

AGTCAACAAGTGTCAA7GGGC7CGTA7TGCCCGAACAGA77AAGC7AGCA 

7GCCAACGGGA7AAACGAG7CGC7C77GG7GGAGGG 

>Contig9 

GGGG7GGGGCGCCTGG7G7T7C7AAAGAGGATC7CC7GCCAGAAATGGTG 
7GC7GACAC7G7TG7CC7CC77GG7G7GGAAC777GG7GGGAAGAAAGGT 
TGGAAAGGGAAA7777GA7CC77GGA777AACCCGAG777G77AC7GATG 
C7CACAAGAC7AGGGGAAGGA7AAAGGCAGG7GAG7CACTC7AGGA7GGC 
7CAN7GAGC7CCACAGAGCTGGAACCACAGGCACCAGGAGGGATTCAGAG 
CAGGCC7CAG7GCACG7CAGC7GAG7GAACCAA7GAGCAGGTGA7GGG7C 
CAGGCAGAGCCC7G7CC7C777AGGCAAAAACCC77GAAACACCG77CCC 
A7CC7AGCC7G7G77CCACCCAAAGC7GGCCAG7C7CCAGGCCC7GCC7G 
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AGC C CCAAGGAAGTGG7ATGGTGAAACAGAAGGGCCATTC CTGTCCAATG 

TGTGAGGAACTTCATTTCAGACTTGTTGGAAGCCCTGATGTTCAAAAACC 

TCAATGATATCATTCATTTTCCCCATCCATTCAATGCCCATCCAATGCCC 

ATCCGTTCAATGCCCCTTCCATTCCTCTTCAGGGAAATGAAAATTGTTCA 

GAAATCCTTTCTCTTTCGAGAAACCAACCAAACCAAAACCGCGAAATTCA 

CTAAACTAGCCAAGACACAATCCTGGGTTATTTTCCTTTTCCCAAACCTC 

CTSTGTTTAAATTAATTCTACCCTGGTTCTCGGCCCTTACTGCGAAGGTG 

AACTCACCTAACCTCTCCCAAACAGAGAAGAAACTTCTCTTGGTAAAATG 

GGTTTTAACACTTGTAAAAAACCCCC 

>Conniglo 

GCTATGGTTCTAAAGGTAATGGACTATGGCGTACACAACGTCTCGCTCAT 

CGTCTGCCAGGAGGCTAAGGTATCCACGGACAATCGCTGAGCAACAGTGT 

CGTTGATCCATCTCTGTACGCACTTGTCAACATGGCAGGAGTACGGGAGC 

TGCGAGAATCCTCTCTGCTGATGTCCCACGGAGCATGCCGTGAGACAACG 

CCACGAACGGCCCTCGGAGANANCTACTCTGCAATGAAGACGTACGATAC 

ACACGTAGGAGTCCTAGCTCACCAGCCGTATCTAGGTATACTGTACTCGC 

GGATACTCACTCGTGCATGCGGCAATAGATCGATACGCAGTCGTCACGCC 

CATGCTCTCAGTGTGTGACCTTCTGGCGGTAGCGTNGTGGGCGCTATTAC 

TGTGCGCAGCAGGCGCNTCGTACATGTGTCGGGTAGCGATGCCAGGAGCT 

GTAACATAGCAAGTCGCCCCCCTACTCCTATCACTATCCCTACGCTGGAC 

CGCACTCGAGATCTGAACGCACGTCTTAACCTGCCAGTACTCGTGAGACC 

TATACTGCGCAAGCCTTGGCTAGGAGATCCTGCAGCGCCGGCAAAGAATC 

AGCTA7GATCCCCTTGCGATTATCGCACACGCACCATAGAGTATGTGCAT 

ATTAACCTCTGAATGTGCTGCAAGCAGACGGTTGCTCAACATATATATGG 

ATGTGGGGAAATCGCCCTGGTCACCGCCACTTGGCGTCAGGAGGCACCAG 

CACGTCTGAGTGTCACGCACGTTACTC 

>Concigll 

GGCCGAATGGTGAATTCATCCGTCGTCTCGAGGGGGTGAAAGACGGGGAG 

TTATGCTGTAATGGCACCGCTCACCCTGGGCTTATGAGCAGACCTAACCC 

TCCCANAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGGCCCAGTAT 

CTGAACTTCTGTGGCCAGGCAGAAAAGGTCCTGTGTTACTCGTCTCCTTT 

ATCATTCA TGTC CATATTCTCCCATTTGCTAACATTTATGTTTCTGCTCC 

ACTGGATTCTTTGGATTTTTCTAGAACATACCCATGCTTTGCATTGCCTT 

GGTCTTTGAATATTTGGTCCACTTTTCCTGCAAAGTCCCCTCTCACCTTA 

TCTTCCTGGTAAACTTCCAGCCAACACCTCTTTACTAACCAGAGAAACAT 

GGTTCAACTGTGCACAGGCTTGCACAGAAACTGTTCTCATATTGTCTTGT 

CATTGTCAATGTGGCAGAGATGCACCTTAGATACCTCTTTGAGAAAGGAC 

TCACTGCCCAGCTGCCTGGCACGTGATGAGCTGATAGCTCCAGCTATAGA 

CTCCTTTAGGGTCAACCTCTGCTTTCCAGTTGAGATCATATCCTTTGCAG 

GGTGGCCTCCCCAGTGATGACTAAGGCAGTGTTACAATGGCCTAGTCATT 

TCCTCCCAATGCTGGACTCCCAATGAACCATCTGCTCCGGAGCTTCCCAC 

TGGGCAGTCAGAGACCTTAGCTAGTCTGCCTCCGAATCAGAAGGCTCTCT 

CTTGCCACTCTGGCC 

>Concigl2 

GCTGTGTC7AAAGATTCACGGCTGTAGTTCCAACTCCCGCCGCCCTCTAC 
TGTGTCCTCTTAATGGCAGTCATTCACCATCTTCCTGTCCCTCCCCTTCA 
TTTCTTGGATGGTGACTGTCACTTTGCTGCAACAGAACCCTGTCCCAATC 
CTTGATGGTTCAATACACACATAGACATTCTTTTTAACAGGGCGGCCTCT 
CAGGTCTTrAATTTTCTTCCCTCCAATAACCTTGTGATGATCCCCCAGCT 
TAGCCACTTACTGCCAGATCATTACCAGTAACTCCAGCCCCTCCTTAATT 
CTAGT7TCTAATATCCTAATCTGTGACCTCACATTCCAACTTCTTCATTC 
TTATCCCCTGAGTCAAAAAATCCTTTGATCCATGCAATCCATTAAGTCAT 
CTACCTTTTCACCATTCTTCGCCCCACTAGGGTTCTCATTCCTTTATTAC 
CCATATGAAATTCCAAGGCCTGTTGGAATCACTCCCTTGCAGCCACTGTC 
AATACTTCTGCCCCTTTTACTTCATCACCCTTATGTGGCAAAACCACAGC 
CCTGGTGGAGTCGATCCTTACCCCTGCTCTGTGCCAACAGCCGCACACGC 
ATGGCTGATGGAGGTTGGAAAAATCCACACATGCAGTGGGCCCTGTATGT 
CCATATACGTATCCAACCTCCAGCCTTGCATATGCCTCAGTGCTGCCTGA 
CAACACATTATATGTTTTCC7TAGT7CCTTCAGTCTCCTGGGTGCCTAGG 
TGAGTATCTCAGACATCC7TCTCTCTCTGCAAAGCTCCAACACCTCCACG 

FIG. 4 (3 of 61) 

filar 


WO 99/06426 PCT/US98/16I02 


TCACATTCAACTGATGACrGTGTCTCCTATGTCACTTAGATCACAGAGGC 

ATACATAAACAAATCCCAGCCACTGCCAGCACTCTGCACATCTGCGAGCA 

TGGCACCCCCAATCTAGGCCTTTCCTGCTGTCACTTGGGGTGAGCTGATT 

ATAC TC G ATCCTAGTCATTTCT ACTTATGCAC 

>Concigl3 

CrTAAGGCCTCCCTCTAACATTTTAATTTAAGATTGAAAAAGCAAAGATT 

ATXCTGTTTTGGCTGCGCCTATAGTAAAGTAACCCCTATGNCAAATTTTG 

ACACCTTATAGTATTTGACAGGGATAAGTATAAAATTGCTTGATTGATAC 

ATCCACACCCAAATGTATGCTGGGAATGATTTTGTTTCACGGCACTCATT 

ACTTAATTTTTAAAACTCTTATTTAAATTTGCAATGTTTTAAATGACCAT 

CACTTAAAGTAGTAATCAACAGAGGTTAGGAGAACATAACAATACTCTTT 

CTCT TAGAAAATACAACAGAAATATAATTTTTTACAGTTTTGCTCCCAAA 

CTTTTCTCTGTAATAACATGCCTTACTCACCTTTACAATAGGTTTGTTGT 

GAGAATCTTGTAATGTAAACCCTGGGTGTTCTGTGAAGCATTTTTAAACT 

TCTAGTTTACACTGACTCTTATTCAAGTGTTTTTAAAAATATATTTAAAA 

AACTGGCCAGGTGCAGTGGCTCACACCTGTAATCCCAGCACTITGGGAGG 

CCAAGGCGGGCAGATCACAAGGTCAGGAGTTTGAGACCAGCCTAGCCAAC 

ATAGTAAAACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGT 

GGCGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGAAGAATCG 

CTTGAACCCGGGAGGCAGAGGTTGTGGTGAACCAAGTTTGCGCCAATGCA 

CTGCCAGCCTCTGCAGNGACAGCC 

>Contigl4 

GGGGGCGGGCCGAGTGATCCTAAAGCCCGCTCGCTTCACAACAAAGCCTA 
ACAG7CCAATCACTTAATGCTGCATTTATTCCTGGGGAAGCAAGTCTCCT 
TTGCACTTTACACAGTGAGATAATCAGTTTCTCATGTGGACCACTGGGCC 
AGGAGGGCCTGACAAAGGGCAGTCTACATTTCAGACTGGAAACTGCTCCC 
AGAACTATTTCTTTCTAGTTCCCACCTCGGTCTGAGGTGCCTGAGGAGAG 
GGACTCAACAGAGGAAGCAGGAGCATAGCTCAAAGTCTCAGAACATGGAA 
GAGGAAAAG AATC CTCACAAGATTACGTAACTTACAGGCGTGTTGCTGCT 
TCAGTAGAAGTTTCATCTCCCTCAATCCTGTACACTTTTCCATACATTAC 
ATACTCAAACTGGTCAGCCCTATGGAGCAATAGCAGCAAAGTTATTCTTA 
ACAGTAATTAACAATATAAAAGATCCCATTTAAAAATGGTTACTGGTCAG 
CCGGGCGTGGTNNNTCNANCCTNTAACCCCANCACTTTGGAAAGCATGCG 
GGCGATCCCAAGTCTGATATCGAAACATCTGCCTAACATGTGCAACCCCT 
CTCTACAAAATACAAAAAATATCCGGGCTTGTGTTGGCGCCGTTATCTCA 
CT ACC CGGAGCTAAGTAAGAAATGCTTTACCTGGAAGCGATTTTTTTACT 
TATATCCCCTCTCTTCACCGGGCGCGACCAAATTCTTTAGTATAGGAAAG 
TTTATTGTTTTATGCCTTTGTCAAGGCTCTACTGTATCTTTTCTGTCCAC 
TCAC 

>Contigl5 

GGTT CT GAACAACAGCAGGCGATTC CTAGC C CTGTAC C C GGGGCATTGTC 

CAACACTCGACAGGGCTGAATTCGTCCATAACGGTGTGCCCCTCTGGGAT 

ATAGGATGAAATGAATTGATCTGAGTACCTGGGATGTAAAGTTACTAAAA 

CGCCAGCTAGGTTCACGCCCCGATGCTTAAATATGATCGTGGCCTACACC 

TCGTCCAGCAGAAAAAGTACCCTTTCTTCAACACCACCTCACGATCCTCC 

AATTTAGGAGCTATAAAACTCATGACTCTTTATTTACCCCCTGCAGATTC 

TCAATCCAATAGTGTGTGTCTCCCTGTGAACTCACGGATATACCGATTTT 

CCCCACGTCATTTCCACACGTCGCAATCGCTTAGTCATCCCTATGTATGA 

GAATCATGGATGACTATGTTGAAGTCCATCTATAAAGTTCAACCCCCATC 

TCCGTCCCTGATTCCCCCTCCCCAAGATCACCAACGCGACTCGACATATT 

GTTATCGCCCAAGGGACCTCTTGCATCCCCCATATCCACTGGTCACCTCC 

CCTCTTGGCTGGAAGTCACCGGGAAGTTCTCCACATGTTGT 

>Contigl6 

TGCGAGCGATGTTCCTAAACTTTAGCGCCATTGACTCGAGCATGGTCATG 

GCTGTTTCCTG 

>Concigl7 

AGGGTGTTCCTAAAGGATACTACGTTCCCTAAAGTCCAGAGAAAAAAAAA 
AAAGTAACATAATGTGGCTTATTTGGTATAAAAATTTTACAGGAAGCATT 
GTCAAATATGAAATAGTGTTTGGTTTTGTTTGGGCTGTATTTGTATAAAT 
ATGTTATTGGTATGTGTTCCAAAATTATAGGAAACTCCTATAATTCTGAT 
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ATGACTTGGTGTACATTATCAGTAATAATTATAATTGTTATGGTAAATTA 

TTGTGTGCCATGGAGGTAACAAATTTCCTCATCAAGTGTGTCTTTGACTA 

TGGTTGCCCTAAAACTTTTTGCCATTCACAGACAATTGTCTTGCTTTGGT 

CCTCTTTAGAAGGTGGTTTTATAATCAGCTATAAAACTCTAACGGGTGCT 

CTTGAATGCAGGCTTAAGATAGCTTTGGAGACTGTGACATCAGAATAGAG 

GAAAAACTTTCAGTATTCATGGAGTGCTGAAATATTCATGAATATCAAGC 

AAAACAGGAATTAACTTCATAGATGGAACTAAAAGAATGCTGAAGTAATC 

TTTTTGACTTTTTTTCTTAAAATGTTGATCCTTCGTTTTGTTTTTCAGAG 

TCAAGGAAATTTTTCTGTTGAGATATTGACAGCTTTTAACAATTAAGTAT 

ACTCCAGTGAACACAATTTGGAGCATATTTGTGTCTCTCTATATATATTT 

GGAAACAATNTTTGAGTATTCTTAACTTATTGCAATATT 

>Concigl8 

GGTTGTCTGCTATACCAGTAATGGGATTGCTGGGTCAAATGGTATTTCTG 

GTTCCAGATCCTTGAGGAATTGCCACACTGTCTTCCACAATGG7TGAACT 

AACTGACACTCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCACATCC 

TCTCCAGCATCTGTTGTTTCCTGACTTTTTAATAATCGCCATTCTAACTG 

GCATGAGATGGTATCTCATTGTGGTTTCAATTTGCATTTCTCTAATGACC 

AGTGATGATGAGCTTTTTTTCATGTTTGTTGGCCACATAAATGTCTTCTT 

CTGAGATGTGTCTGTTCATAT C TTTTGCCCACTTTTTGATGGGTTTTTTT 

TTCTTGCAAATTTGTTTAAATTCCTTGTAGATTCTGGATATTAGCCCT7T 

GTCAGATGGATAGATTGAAAAAATTTTCTCCTATTCTGTAGGTTGCCTGT 

TCACTCTjACAATAGTTTCTTTTGCTGTGCAGAAGCTTTTCAGTTTAATT 

AGATCCCATTTGTCAATTGGCTTTTGTTGCAATTGCTTTTGGTGTTCTAA 

TCATGAAGTCTTTGCTGVTGCCTATGTCCTGAATGGTAT7GCCTAGGTTT 

TCTTCTATGGTTTCTATGGTTTTAGGTCTTATGTTTAAATCCTTCTTTTT 

TTTTTTTTTTTTTTTTTGAGATGGAGTCT7AGTCTGTTGCCCAGGCTGGA 

GAGCGAGTGGCGTGTCTNTAGGACGC 

>Concigl9 

GCATGTTGTCTAAAGGTTTGTCTTCCTCCAAAATTCATATGTTAAAACCT 

AGCCCCAAATGTGATAATATTTGGAGGAAGGCTCTTTGGGAGGCAGAGCC 

CTCATGAATGGGATTAGTAGCCTTATAAAAGAGACCCCTGAGGGCTCCCT 

TGTCCCCTCCACCGTGTAAGGATGCAACAAGAAAGTATGGTCTATGATCC 

AAAAAGCAGACCCTTGCCAGGTACCCAATATGCTGGCACTTGAACTTCCC 

AGCCTCCAGAACTGTGAGAAATAAATTTCTATTTTTCATAAGCCACCGAG 

TCTATGG7ATTTTGTTATAGGAGCACAAACAGACTGATGTGCCACCCAAC 

CATGATTATACGTGTAATTTATGGTTTCTCTGCTAGTAGGGATGCACCAT 

GGGGTTAGGAACCACGCTTTTCTTATTTCCCACACAGTCCTTAGCTCTAA 

GCATGTTCCTGAATCAAAGATCCCCATCTTTTATGAATGAAGGAGTCAGT 

GAATGAATTAATGAAAGAACTGATAACCCTCAATAATTATTCCAGCCTTT 

TATACCTACTATTAA 

>Contig20 

ACGGTTCTCTAAAGACTTTCAAGAGCTGGATTTTATGCTTTAGGTGAAGG 
TGATAAAGTAAAGTGCTTTCACTGTGGAGGGGGGCTAACTGATTGGAAGC 
CCAGCGAAGACCCTTGGGAACAACATGATAAATGGCATCCAGGGTGTAAA 
TATCTGTTAGAACAGAAGACACGAAAATATATAAACAATATTCATTTATC 
CCATTCACTTGAGGAGTGTCTGGTAAGAACTGCTGAAAAAACGCCATCAC 
TAACTAGAAAAATTGATACCATCTTCCATAATCCTATGGTACAAGAAGCT 
ATATGAATGGGGTTCAGTTTCAAAGACATTAAGAAAATAATGGAGGAAAA 
AATTCAGACATCTGGGAGCAACTGTAAATCACTTGAGGTTCTGATTGCAG 
ATCCAGTGAAGGCTCAGAAAGACAGTACACAAGACGAATCAAGTCAGACT 
TCATTGCAGAAAGAGATTAGTACTGAAGAGCAGCTAAGACACCTGCAAGA 
GGAGAAGCTTTGCAAAATCTGTATGGATAGAAATATTGCTGTCGTTTTTA 
TTCCTTGTGGACATCCAGTCACTCGTAAACAATGTGCTGAAGTGGTTGAC 
AAATGTCTCAAGTGGTACGCAGTCATTACTTTCAAGCAAAAAAATTTTAT 
GTCTTAATCTAACGCTATAGTAGGCATATTATGTTCGTATTATCCTGATT 
GAATGTGTGATGTGAACTGACTTTAAGTAATCAGGATTGAATTCCATTAG 
CATTTGGTACCAAGTAGGAAAAAAAAATGTAAAGCCAGTGCTTAGACACA 

>Conxia21 

CGCTGTCTTAAGAACTGGGCTAGGAGTGAGCAGTGAGCCAAGATCGCACC 
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AXTGCACTTCAGCCTGGGCAACAAGAGCAAAACTCCATCTCAAAAAAATA 

CATATATATATATGACCCATAAAAAGGAGATAAATCAACACTTCAGAACT 

GACCCAAAC7TGCAAAGATACTATAATTAACAGAAAAGGACAGTTTACTA 

AGTACTCCGTATGTTCAACAAGTGAAAGATTAAACATATTAAGTAGAGAT 

GTAGAAGATATAAGAAGATCCAAAATGAACTTTTAGAGTTGAAAACTACA 

ATATTTAAGATAAAAATACACTAGGTGGGATTAAAAGTAGATTACACZATT 

GC^TAAGATAAAAAAAAATGAGCCTGAATACAGCACAGTATAAACTATCT 

TAAACAAAAACACAGAGAGAAAAAATAACTTTAGAGACTTAGCTCTTATC 

CTCTATTTGTTTCTAAACAGAGGATAAGGGGCAGAAAAAATGTTTGAAGA 

AATCATGATTTTTAAATTTCCAACTGAGATAGGAATAGCACTGGGTAGTC 

ACAGGAGGCTGGAAAGACCCAAACAGCAGTTAAAACAGGAACTAGGCAAA 

GAAACCAAAGGATAACAGTAAACCTAAACTAAGGGAGAGAAAACTGACAA 

AAGCTGACTTAGGATAACTGAC 

>Contig22 

CCTGAATATAAGCCGCAAGTAACCAATTAAATTTGTTTTCCAAAATTGTA 

TTAACAATCTATGAAATTTTTATCTTGACCATAGCTATAACTTCCAGAAG 

CCTTTTATAJVCCTCTATAACCTTTATTAAGGAGTAGGTTAATGCTTCAAG 

AAAACCTTGTTAATCTGACACAGGACCCATATGCTGATCTTGCATCAGTG 

TGGCTTGGACATCAATGATTATGATTAATTTATAGAGAAATTGAACTTAT 

TTTATCTCTCAAAATTGGCCCTTACAATCTCACACACCCACCTCTTCCAC 

TATAGTTCCTGGGCCTTGAGTTGAATAGCTTTAATTTCTGGCTrtSTGTT 

TCAAGAATGCAGTTTATTTTGATTGGCATTTTCTACCAGTCCTGAAGATG 

AACCTTTAATTGCTGTCAGTATTTAAGATTTAGCAGGACTTGTCCTTTTA 

AGAACCAGGAGTCAAGCCCTATAACTCAATGTCACAAGGACTTTAAAAGC 

ACATACATAAAGATATATGGATGTAATAATCATAATTTTTAAAAAATTGT 

ATTAATCTCAGTGTTTTCTAAGCAAACCAAAACTTAATAATAATGGCATA 

GAAATTATTTCAATAAAACATAAAATCTGTTAAGCCAGTTACCAAAAGGC 

AAAAGAAAAGACCTTCTGCAATGCACAGAATATTATGTTGGAAGAAAACA 

TTTCCTTTAGACCTTTAAGAAAACATTGTTAGCATCAGGACACZAACAAAC 

AGAATCTGAGGGTAAAAAACGTATATGAGCTGAAGGGAGTTGAAGGAGGG 

CATTACTATTTCCCACCCTTTTAAAGGGGAGAGAAAACCTAAAACAGCAA 

GATGCAATAAAAGCrGAACTTTGGGTTAAAAAAAAATTCTTAAGTCTCTT 

ATAATTTATTAAGAGTGAATCAACCCCGTAAGAAAATTTCATTGTTCTAA 

CCAATTTTTTAATATATAAGTAGTTTTTTAACATCAACCCAATCTCTAGA 

AAGACCATTATAATTTCCCTTTAATTATAGACAACTTTATCATATAAAAG 

TTTTTTTAAATAAATCCTCTTATTGTGACTTACACAGACTATTCATGACA 

TGCTTGGACTTTCTGGTTTGTCGTGAACATCCTTTTCTTTCTTTCTTTCT 

TTTTTAAATTTTACTTTACGTTCTGGGATACATGTGAAGAACATGGAGGT 

"TATTACGTAGGTGTACATGTGCCATGGTGGTTTGCTGCACCCATTAACC 

CGTCATCTATATTAGGTATTTTTCCTAATGTTATCCCTCCCCTTGCCCCC 

CACCTCCTGACAGGCCCTGGTGTGGGACATCCCCTCCCTGTGTCC ATGT G 

TTCTCAATGTTCACTCCCACTTATGATTGAGAACTGCAGTGTTTGGTTTT 

CTGTTC 
>Contig23 

GCTAAATATAAGCTATGATAAAACAGTTGGCCCTCTGTATCATGGGTTTC 

ACAACTGTGGATTCAACTAACTGTGGATGAAAAATACTTGGGAAAAAAAG 

AATGGCTGCATCTGTACTGCACAAGTGCGTGCTTTTATTCTCGTCATTAT 

TCCCTAAGCAATACAATATAACAACTATTTATATAGCATTTACGCTGTAT 

TAGGTATTATAAGTAATCTAGAGATGATTTGAAGTATACAGGAGGATGTG 

CTrAGGTTACATGCAAATATTATGCCACTTTATATAAGGCCCTTGAGCCT 

CCTCAGATTTTGGTATCCATGGCAGTCCTGGAGTCAATTCTCCTGCAACA 

TCTCCATTTGTTCAGATTCTCTTCTATATCATGTTTATA TCAGAA AATCT 

ACATAAGATTTTTTAATGTGTTCATATAGGTTTTGTGTATTTTTGGTTGT 

TAATCCCTAGATATATGCAGTATTTATTGCTATTATGAGTAGTGTTTCTT 

TACCATGTATTCTAGTTGGTTATTGCTGACAGAGAAATGTTGCTGGTGTT 

TCTAAGTTACCTTGTTTCTAACAACCTTGCTGAACTCTTATTAGTTCTCA 

TAGTTTTTAATTAATCTTTCTTAGTTCTGATAACATAATCTGCAAATAAT 

GACAATTTTATATCTTTCTTTCCAATGCTTATATCTCTCAGTCCTCTTTA 

TCCCAAAGTATTTTCCAGGATCTCCACTATAACATTAAATAGTAATAAGA 

ATTTCTGTCTTGTTACTGATCTTAAGGAGAATAAATTTAAATTTCCTCTG 

FIG. 4 (6 of 61) 


WO 99/06426 


PCT/US98/16102 


^CAGGTTTTATGCTTGATATAGATTTGTQATATATAGCCTTTCACAGkJt'i , 

AAAAAAAAATGCrrTCCTAGTAGTCCTAATTTTTTAAAAAAATCATCATA 

AATAGATGTTGAACATTATCAAATGCTTTTTCTGCATCTATAG AGATA AT 

CATATGGTTTTTTACTATTTATTAATGTAATGAATTAGACCAATTTTCTA 

ATGCCAACTCTTTCTTGTATTTGTAGGGTAAATCCTATGGGATCATAAAA 

TACTTTTAATACATTGTTAGATTTGAAGAGTTAACGCCTTATTTAGAACG 

TTTTCAG7CACATCCATAAGTGAAATGGCACTATAGTGTCTATTACTATT 

ATATTTTTCTGGTTCTGAAACCAAAATTATACTCACCTCATACAGTAAGT 

TGGGCAACTTTTGTTCTTTTTTTCTGAAACAATTTGTGTATAGAAGAAAT 

TAACTGTTCCTTGAAAGTTTGATAATAATCATCCAGAAAATTATCCCCAT 

CTAGGGCTTTTACAAAAAGGAGACTCTAGAATGCCATTTCGGTTTCCTTG 

ATGTGTATTGGCCTCTTTCATTTAGGCTTTTGGATTTTTTAGGGCATTTT 

TTCACTATAGGCTTTTTACCGG 

>Contig24 

CATAAACTTCAGGTTGGATGTTCGGTCAAAGTGGTCCGGCGATGCGAAAA 

CGAGAGGGCTCGAGGACTGGGCAGAGAACTATTTGAAGGTAXCTCTCAGG 

GGAAACCAAGCGGAAGGCGGGGAGTAAAATTGGGAGGGAGCGACGGCCTT 

CAAAGAAGGGGCTTGCATTAGATCGGCGAGATCCGGGAGGGTCTGGTGGG 

GAGAAATGACTAGAGGACAAATCTAATGGAGAGACAGACGGAGATAGATA 

TCGTGACAGAGAGAGGGACAGTGACAGCGCACAACAGTGCAGGGTCCATG 

AGTACAAGGCCCTTAAGTGTACACCCCAGCCGGAGTCATGGCAATTCGAT 

TCCTGTACTGACCACCCAGGATTTGGGTAGACTGTACGAGTTAATGAGCA 

TGGTCCCCAACAAGACTGCTTCGACCTCAGATGCAAAGCACACTTCAGGG 

GTCCCCAAGCCACTCATGTTTTTTGAATGACTGCCATAAGTTCAAAAATT 

CCCACAATTCTCTCAGATTCAATAACTGGGTATAACCACTCATAGAACTC 

AAGAAAATGCTATCATTATTATTACAATTTTATTATAAAGGATACAAATC 

AGAAGGACTAGCCAAATGAGGAGACACATAGAGAGAGGACTAGTAAAAAA 

CAGAGCTTCTGCGTCCTACCTTCAAGGAATCAGGATGCACCACCCTCCCA 

GCACATCAAGTGCTCATCAACCAGGAAGTTCCTCTGAGCTCCAATGTCCA 

GAGATTTTAGGGAGGATTCATTACATAGGTATCATTGATTAAATCATTGG 

CCATGTACTTGAACTCAATC7CCAGTGTCCCTCTTCTCCCTAGAGG TCTG 

AAGGGTTItXSCTAATATCATGTGGCTCAAAGCCCCAACTCTAATTACCTTT 

TTGGTCTTTTCAGGGACTAGACCCCATCCTGAAGCTATCTACAGGCCCTG 

CCATGAGTTAGCTCATTAACATAACAAAGACACTTATATTACTCAGAAAA 

TTCCAACAGTTTTAGAAGCTCCATGTCAGGAACCTGGGACATAGATCAAA 

TT C ' mTL"l I 1 IT I TT m TTTGGAGACAGGGTCTTGCTGTGTTGCCCAG 

GCTAGAGTGCAACGACAGATCACAGCTCAATGCAGCTTCAACTTCCCAGG 

CTTAAGTGACCTTTCCACCTTAACCTTCC AAGTATC TGGGACCACAGAAA 

ATGGCTAATTATCCTGGCrGATTTTTAAACTrTTTTTTTTTGTAGGGATG 

GGATCGCCCTGTGTTGCCAAGGTTGGTCTCAAACTCCTGGGTTCAAGCAA 

TGATTCTGCCCTGGCCTCTGTGA TGGTTA ATACTGAGTGTCAACTTGATT 

GGATTGAAGGATACAAAATAATATTrTTGGGTGTGTCTGTGAAGGTTTCG 

CCAAAAGACATTACTTTGAGTCAGTGGACGGGGAAATCCCCCCTTCCCCA 

TGGGACGGGGAGACCCCCCTCCATCCAGGTAAAAAAATCTAATCACCTGC 

AATGTGGCAGAAATAAAGGAGGGAAAAAACGGGGACCCCTANATGGGTTA 

TTCTCCACCTAATTCTTCCCCCAGG 

>Contig25 

CCATGTATTTCATTTCTACAGACCCTGAGATGAATTTGTCATTGCCACGG 
GGTCCTGAAGTTCAAATACTCTATTTGGTATCCTGCCCCTGTGGTTAACT 
GTGATCATTTCACTCACCTTGTTTATGATGAGAGGTGCCACCATCTGGCC 
TCCTCCACTCTGCAATCCTGTTAATTCCTATCAAAGCTGAAAACCTGCTG 
CAGCACCCACACCATCACCTCCAGCCTAGAGAGGGAAGCTACCAGTGAGC 
TCTCCTGGATGCCGGTGTGCCCCTCGCCAATACATTTCTTCTTAGTCCCT 
TGGTCATCCTGAGGTGTGTGATTAATGGACAGCTATGTGGATTGCACATA 
ATAGATGTACTCCAGCATCTTCATCCCTGATTTTCCTT TACAG AAATCAC 
TCAACCTTAGCAACATGTGAAAATCACCTAAGGACATTCTTTAAATCCCT 
CTGTCCACATGGCAACACAAACCACTTAAATAAGAATCTCCAGGGAGTCA 
CTCAAGCATCAATGTTTTTTAAAGCTCCAATTTTAAGGATCATTACATTA 
TGT CGAAGAAATTATAGTATTTCAGCCTTACTGACTGTAAACCACCACCA 
TATCTAAGCATCCATTAGTCAACCTAGCAGACAATAAACTAACATTACCT 
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ZCAGGTACTCAAATCAATTCATTGCATCCCAAATCCCAGATGGGCCCACC " " 

C77A77GACAAA77CAGCCCAA7C77GG77GAACACA777AGAA7A7A77 

TCCA7GAACAA7A7CCGG77GACGAG777C777AAC77T77GGAG777AA' 

GCCA777CC777CACAG7AGCC77G77AA77CCC7G7CAA7GC7CCA7GG 

GGGTCATGAAGAGACCTCTTATTAACTGTGAAGCAACTTGGCTCAGGTGC 

AGACACT CAAATGCTTCACATGCAGTGGGAAAAGAGAGTGATTGTCTAC 

>Contig26 

777AAAAAGAAC7GAG7C777A77CAG7CGA77C77C7AA7C7A7GAACA 

7AGCA7C7C7C7CAAAGCA777AG7CC77C777AA777C7G7CA7TAAT7 

TTTTAAAATTTTCATC CTAAAGATTCTGTATATGTTTTGTTGAATTT ATG 

CTTAAGCATTTCACTTTCTTGGTAACAATTATAAATGATTTTGTGTTTTT 

TATTCCACTAGTTCATTTTCAGTGTGTAGAAAAGCAATGAATTTTTGTGT 

GTTGATCTTTGTTCCAACATCTTGCAACATTATTGAACTCATTTATTAGT 

TCTAGGAGGTTTrTTCATTTTTCTTGTAGATACCTTGAGATTTTCTATAT 

AGACAGTCATG TTGTCTGCAAACAGG CACAGTTTTATTTCTTC C TT IT CA 

ATCTATATGCCTTTTTTTTTTTTTTTGCCTTATTGCAGTGGCTAGAACTT 

CTAGCACTATGTCAAATAGCATTGGTGAAAGCAGACATCCTTGTTCCTTG 

TCTTAGAGGAACATTTGGTCTTTAATCTTGATTTAAAAAATTCCTTGCAC 

TAAGTTACCGTGTTTTGCGGGAGGGAGAGGTGGGGTGAGG7GGGGATTTC 

CCCTAATGTTTACAAGCTGGGATTTTCTTTTTCCTGTGTCTAATTATTTT 

C CTC ATTGGCTTGAAAAATCTGATAAAACATTTTAGGACTGTGTATAAAA 

7AGAA77AGCCAAG7GCAA7G7C777A77CAGAAGAAA77TCA7GGACG7 

TGTGCCTACTCTCTTGGCTTCCTGGCTTCATGGCTTTCCAGATCCCACAG 

7AAGC7C7GGA7AG7AGAAG77A7AG7AAGAC7GAC77C7AAA7AAA7GA 

AG7GAC777AACC77AC7GA7A7GGC77AAAGAAAAGGAG7GGCC77TAA 

GATCCATGAACTTCTCAAACAAAAGTGATAACGTTATCTCCATGCATATA 

TAATACTAAATATAATGCAACTGAGAGAAGTAGGCTGTGGTAAGAAAGGA 

GACCCAAG7GCCATCTGAAGGCAGCACTTACCACTCTGCTTCATCCCACC 

GAGGAAACAAAGCATGAGTATTGCCAGATTTTCTTCTGTTTCAAGAAAAG 

CCAGAAATCCAG G TTTTTGCGTGAAATGTCCTGATTTTAATGTTGGGAAC 

TAATTTATATTTTGAAATAACATTGTGTGGGACAAGTGAACTTGTATGTG 

GAACTGCTTTCTCCCAGTGGCGACCAGTTTGGACCGTTGATACTCAGCAA 

GTTCAGCCAAGTGCGCCTTGTCATTGTCAGTCATCAAGGTGATGTGTGAT 

7GG7CAAGCAA7TAATTTTGCTCAGCA7C7CG7G7G7T7TCAAAAGAAC7 

GAAGGTTCATTTGC 

>Contia27 

TTTCAGAGCACAATGCGTATTCATAGTATATTGACTTAATTTCTAAGTGT 

AAGTGAATTAATCATCTGAATTTTTTATTTTCAGATAGGCTTAACAAATA 

GAACATTCTGTATATAAATGTGTAAATTAGAGTTAATCTTTCCAATCACA 

7AATTCGTTTTATGTGAAAAAGGAATGAACTGTTCCATGCTGGTGGAAAG 

A7AGAGA77A77777AGAGG777G7CG77G7G77T7GGGA77C7G7777C 

7777AAAA77GTAAATATGTACT7GTGTGAA7GAT77TTTAAAA7GATT7 

7ACCA77777GGAAGGGTATTTAATGA7AGAA7ATCA7CGAGCCAACATG 

CACTGACA7AGAAAGA7GTCAAAGA7A7A7TAAGTGTAAAA7GCAAGAGG 

GAAAACAC7A7G7ACAG7CTGAGCCAAA7CAAAGCA7G7A7G777777A7 

A7G7G7ACAACAAAAGGTTTGGAAAGATATGCGCCGAATTGTTAAATGTG 

G777CAC77GAGGGGGTGGGAGGA7GGGGCCCCAGAGGGGTT7TTATGGG 

GGCC777CAC77GG7A77T7T77CA7777G77C7G777GAAA7777G777 

7T7C7T777AAA7GGAGTTTCACTCT7GTCGCCTAGGCTGCAA7G7AGTG 

GCG7GAAC7CAGC7CAC7GCAACC7CCGCC7CCCAGG77CAAG7GA77C7 

CC7GCC7CAGCC7CCCA7GCCTCC7G7G7AGC7GGGATTACAGGCACCCA 

7CACCA7GCC7GGCTAA7TTTTGTA77TTCAG7AGAGATGGGG7TTCACC 

A7G77GGCCAGGCTGGTCTGTAAT7CC7GACC7CAAG7GATCCACCCACC 

77GGCC7CCCAAAGTGC7GGGA7T7CAGGTG7GAGCCACCACGCCCAGCC 

C7G777AAA7777TTA7AAG7A7G7AC7AC7TTTG7AATCAGAA7TA77A 

GAAAGCA7777AC7GA7TTAAAAGCT7AGACA7GTTCAAATGCC7GCAAA 

AC7AC77AACAC7CAGCT7TAG7T777C7AA7CCAAAAAGG CCGGG CAGT 

7AA7C77777GG7GCCAA7G7GAAA777AAACGG7777A7G77777CC7G 

7G77G7GAA7GAAAAA7A7T7C7GAG7GGTGG7777T7GACAGG7AGACC 

A7G7C77G7C77G777CAAAA7AAG7A77TC7GA77TTG7AAAA7GAAA7 
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ATACAATATGTCACAGAfCTTCCAATTAAGTAGTAAGGGTTTATCCTTAA 

7C CTTG CTAATTTAAGCTTGCATAAGTCACTTTACTAAAAGATCTTTGTT 

AAGCTAGTATTTTAAACATCTGTCAGCTTATGTAGGTAAAAGTAGAAGCA 

TGTTTGTACACTGTTGTAGTTATAGTGACAGCTTTCCATGTTGAGGTTCT 

CATATCACCTTGTATCTTGAAGTTTCATGTGAGTTTTTACCATTAGGATG 

ATTAAGATGTATATAGGACAAAATATTAAGTCTTTCCTTTACCTAAGTTT 

GCTTTCTTGACTAGTAATAGTAGTAGATATTTCTGTAATAAATGTTCTCT 

CAAGATCCTTAAAATCTCTTGGAAATTATAAAATTATTGGAAAGACAAGA 

ACAGTTTTTATTCATTATATGCATTATTATCG 

>Contig28 

CTTTCTCAAGAAAAGGGAACTGGAGCAATTAAACATATGTAATTTTTTTT 

TAAAAAACCCTAAACCTAAACATCTACCTATATACAAAAATTAATTAACA 

ATGGATCATGGACTCCAATGTAAAACATGAAACTCTAAACTTCTAGAAAA 

AAAACTGGAGAAAACCTTTGGTACCTATGACAAGGCACA G TTTTTAGACT 

TAACACTAGAAGTGTGAACTATACAAGAAAAAATTAATAATTTGAACCTT 

ATGAAAAT CAAATT ATTTGCTCTCCAAAAGACCCTGTTAAGftGGATGAAA 

ACTAAATTACAGATTGAGAGAAAATATTTGTAAATCACATATTTGACAAT 

GGACTTGTATCTAAAATATCTAAAGAACTCTCAAAACTCAACATTAAAAA 

AAATATCTAATTAGAAAATGAGTGAACATTTTACGAAAGGGGCCTTATAG 

ATTAGCAAATAAAACAC7TGAAAAGATACTCAGCATCACTAGCCATTAGA 

AAAATGCATATTAAAACCACAATAATGTATCGCTACACACATATA&GAAT 

GGTTTATGAAAAAATAGTGATGACACCAACTGTTAGTGAAGATG7GGAGA 

AACACTCATACATTGCTGGTAGAAATGTAAAATGGCATAGCCACTGTGGA 

AAATTATTTGGCAGTTCCTTTTAAAACTAAAAATCAATCTACCACACAAC 

CCAGCAATTTCATTACAGGGCATATATCCCAGAGAAATGAAGATTTATGA 

TCACACAAAAATCTGTACACAAATGTTTTATGGTCACTTTATTCATAATA 

GCCAAAACCTGGAAACTATCCAAATGTCCTTCAATGGGCAAAGGATTAAA 

CACACTGTGATACATCCATACCATGGAATACTACTCAGCAATAATAAGGA 

AAGAATTACTGCTACACACAAGTTGGATTAAACTCAAGGAAATTGTGCTG 

AGTGAAAAATTAACAAGCCAATCTCAAAGGACACATAGTTGATGATTCCA 

TTTGTATAACATTAATTAACACAATTAATTACAGAGATGGAGAACAGAAT 

AGTGGTTGCCAGGGATTATACATGGTGGACGCGGTGAGGCGGGCCTCCAC 

GC CTTGGAGATGAAGGGGGCTACACC CTTTAAAGCACAC C CACGAGAGAG 

TTTTGTGCGGAGGGGCCCAATTTAAGTACTCCGCCCCGGGGGGGGAACAC 

AGGGGCAAACAAAAAAAATTGGCCTTGGGGGTGACCAAACACACAAAAAA 

AAAACAAACACACAAAAAAACAACNATGGGTGGGAGGATTAATCGCCAAA 

TCTGAGTAAGCTATCTGGACAGTACCAATATCGATTTCCCAGTTTTGATG 

TTGTACTATAATAATGCAAGATGTTAACATTGGAAGAAGCTGGCTGAAGG 

GGGCTCAGGAACTCTCTGGACATTTCTTTGTACCTTCCTGTGAATCCATC 

ATTATTACAAAATAGGACATTTTCTAAAGGTTAAATCATTTTAATTTTAA 

AATGTCCCTGTTACTGTTGAAACTCACATCTCCATATACTGATCA AGAAC 

AGCACTAATGGCCCCTGGCCTCCAGGAATTCACAATTCCTACTGACTTTT 

CTTTGAAACCTTGGCG\AGTCGCTTCTCTTCTCTGGTCCTCAATTTTTCA 

TCTTCAAAATGAAGATTGAATGACTATTAAAATCTCTTGCAATTCTTGAG 

ATGAAGGGTCCTAAAGGAACTGAAGAGGATGCCATGTAATGTAAATATGG 

GTT77TACTCCATCAGCCAGCCAAGACAGAGGGCAGACACCAAGACATGG 

TAACCAAGGAGGCCATGTGTAAACAAAGACCATTTAGACTTATGCTCTGG 

CCTTTGCAGCCCAACTGGTGTGGCCAGTTGGTGGGGTATGAAGAAAATGG 

GGCCTTCCAGGAACCATGTTGAGTGGAGATAAGCAGGGAGGAATGCAGAA 

GACATGGGGGCAGTGCCAGTCTCAGCCCGAGCCAGCTACACCCACACATG 

GTTATGAAAGACTGACAGCCTGTAAGOTGAACACAGCCCTGCCTCTCTTA 

GATAGGC 

>Contig29 

GCAAATATGATCTCAGATGTGGATTTACTGTAAAGTTCATCAAATTTAAA 
TTTCAGAACACTTAATCTGCAAGAGTCCTTTCCAAGACCCTATACCTAAT 
TTTGTGTTTACAATTTTATATTTGTTTrCTTAAAGAAGACCACCAATATA 
AACTATATCCAGCCTTCATGATAAGTACATAGGAAACTATGCAAATAAGG 
GGGAAAAAAAACAAAGAAAAATACCTAGTTTACTAATGGTTCACTTCTGA 
ATAGCACATATTCATAATGATACAAGCACTCATTACTAGTCTAGGAAAAT 
GAAGATATAATTGCATTAGGAAGATCAAGAGGTAGGAAATGTGGATGTGT 
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G7GGTATAGACTAGGGCAGGACAAAGAACC7AAATCCTCATTTTCTAAAG 

ATAATTGTTAATACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCG 

GTCATTAAGAAACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGG 

AAGAGGGC GACAATCTGATTATTTTTTGCAACAAATTTTGTAAAAC CATT 

TGACTGTTTACATGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAAT 

AATACTATTATTTTTTAACTGG A TTTTTGAAAAAGAAGATAAAAGTCTCA 

TTTTAGTAATTAAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTC 

GAAAATTAAAACTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGG 

AGTTCGAGACCAGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAA 

TACAAAATTAGCTGGGCGTTGTGCATGCCTGTAATCC CAGCTACTCGGGA 

GGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAG 

CCGAGATCACACCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCA 

TCTCAAAAAAAAAAAAAAA AAAAAA TTAAAACCTCTGGAAGTTGAGTTTG 

CAAATATTCATTATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATG 

ATGAAAATTGAGGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCAC 

AGCCCATTCAATTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGG 

TCCATCCTTCACTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCA 

CCTCTAATTTGGAGCCCAAAAAAAGGCACTGGGAAATGGAAAAGTCTTTT 

GTACGTGGTACTTGCCGGGGAAGCTGCCATGAAAACCTGGCCCCACGGTG 

GGGAGGGAATGCCCANCTGAGGCCTCGTGCCCATGCTAGGATAGACTCGT 

CCAAACATGTCAGGTGGTCTGACAGGGCAAGCANCANGAAATCATC5TATG 

AGTATGAACTGATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGG 

GGCGTGAGAAAACAGCACAGTACGTTTCTTTAGCAGCTGTCTCTGCTCAG 

CCATGGGAGGTCACAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGA 

GATGTGAGTGTAAAAAAGTGCCCAAGACACAGTGAGTACCAGGGAGATGC 

CCTCTTTCCTACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACA 

TGGGTCCTCAGAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCC 

TGGTCAGCAGCAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGC 

AATGAGCTCTCAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCA 

AGACCGATTTTCATGTAAGATGGCCTCTGAGGACTCCGAGATACACCAGG 

CTGAGACTAGCTGGCAGCTCCAAGTTGTTGGTCAGAAGAGAACAGGAACT 

AGGGAAATTGGAATTACTGTTACTACAATTCCTTTACATCCGCACAACCA 

TGAGGTCCAGCGATTTTCTATTATTTTTTTTTTTAAGACAGGGTCTCAGT 

ATGTCGCCCAGCATAGAGTGCATTGATGTGATCATGGTTCAGTACAGTAT 

TCACGTCCCAGGCTCAAGTGACCCTCCTGCCTCAGCCTCTCAAGTGGCTG 

GGACAGCAGTTGCATGCTACCAGGCCAGGCTTTTTTTTTTTTTTTTTTTA 

GTTTCTGTAGAGCACATAGC 

>Contia30 

GGTTAACAATGGCACAGGGAAACAAACAGTTCCAGGTGCAGGGGCTCTAA 

ATCTATCATAAGATGTTAGGTATGGGGGCTCTGCCGGACACAAACTCAAG 

GCTTTATGCTGTTATCTCTTGAGCGAAATCCTGGGAACTTCGTACATTGC 

TTGCTTCAGTACCTTATCAGTTAATCGGACTCTTTGATATGTTGGGAGTC 

AGCGTACACAAGTTAACTCCTTGAGGAAGGGGGTGGGTAAGGAGTCCTTG 

ATGTCTGGTAAATGAAGGAGCGAAATCGAGTTCCTCTGGCTTTCTCAGCT 

AAGGGAGAGCTTATTCATGTGGAAACAAGGCTAAGTGATTAAGGGAGAAA 

GGGAGAGTCTGAAAACAAGGTTAGGTATTACAATGTCAATAAAATTGGTC 

TCCTTATACAGTCCTATGGTAGATTTCTTTCCATCTTTAATCTCCCTCTA 

GCAC CAC CAGACTTTTTCTCTCTGTACCTTGAGATGTAAATTTTGCTATC 

TGAATTTTCGTCTAAGAGTTGTTTCCTTTAATATGCAAATTTAGGGTTAT 

TTAGCTGACAACTGCCAAAGTAGTGAAACAAGTTATCAAGAACTTGAACG 

TCTAAGGTAGGAAAAAAAAAAGTCTTTATGAATCTATAAGATGTACTTCT 

ATTGGCATGCCTAATACGTCTATGTATTTACGTGTTGTGTACACAGTTTT 

TCACTACTGAAAATATATAGAGGAGTTCTAATTAATTGACTTAAGACAAT 

AAAAGCGCTTGAATCAAATACCTTATCAGGAAAAAGGAAAAGACAAGTCA 

AATGCTTGTTCAAGTTTATATAACTTAAGTAAAATCTTTAATAAATAAGC 

TAGCTTTAACATTATTTGAAATGTCTTAAGAATTGCCAGCAGGTTCTGGG 

TTACAGAACTAGTGGGGGTGCAGTGGGGTGAGGGTTGGTGGGGTGGGGGG 

TGGTACGGGGGCTTTGTTTTTTCTTGCTGCCCCCTTCTGGGTTGGGGAAG 

TGGCAGGACCTTGGCAGCACCCCGAGCCGGCATGGCGTTAATAATGGAGG 

GATGCCAGACCCAAGTGGCTAAGGCCCGGCTGCAGAGCCAAGTTGGCATT 
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TCCAGACTGGGGCTCGGGCCGCACCCTCTCCAGGACCCTCCCCTTGtACC 
GAGCAGATTGTCGCGGGCAGTTTGGGCCAGCTGTCCTGGCGTGGAATTTC 
CCAAATTCAACAAATCCTCCAAGAAATCAATCCATCCATTCATCCATCCA 
7 C CATC CAT C CAT CCATGCAT CCAT C CATC CGTGGCAGATTATGAAGCAT 
GGATCATTACTTTTGGGATGTGGATATATTCAGTTAACAAGGAGCAGCTT 
TCAAGAGCTGGATTTTATGCTTTGGGTGAAGTTTAGAAACACTAGCT CC C 
AC* 

>Concig3I 

ACCTCATGTGCTCTAGCGCCTCTTACCTCATGCCCTCCACTCTCAGTCTT 

GCACTCACCCTGCCACACTCAAGGGCTTCCCCAGGTTC C ' lTCrTA GATTC 

CACCGATAGCTCAGGGACTTTGCACATGCTACGGTCTCTGCCTGGCTCCT 

CCCCAGATCTTCTCATGCCTAGCTGCTTCTCATCAGCACCCCTCAGAGAC 

TGTCCCTGCCCCACCTCTCCAGGTTCCATACCTGCCACCCTCCCCCAATC 

ACGTAACAGTTTCTTCACAGAGCGAGTTACCATCCCAGTATTTCCCTAAC 

TTATTTTTTGTGACTGGTCTGTTGCCTGTCTCCACCACAAGAACATAAGC 

TGCPlTGTGAACAGGAGCCTTGTCTATCTTGTCACCCCAGTGQCTGTGACA 

TAACCTGATACACATTAGATGCTCAATGATGTTTGATGAATGAAGTGCTG 

GTAGTCCAACTGTGTTTCCITGTCTGTGTAAGTATGTCTGTTGTGGTTTC 

CTAAGAACCTACAGCTCTCCCACTGTGACTCCTGTTCTATGGTCCTGATT 

TGCTGGACTAGAATCCTAACCTACATGCTTACTCTTAGTGTCCTCCCCCA 

GAGGCTGAATCCCAGTCCCTAAACCTCCACCAAATGGCTAAGACCtAGCT 

TCCAACCAGACAGGCCTACGCTGAGACCTCAGCACCGCCCTTCTGCGGTC 

TCATCCTTAACGCATCCTTCAGGGCCCAGCTTAAATGTCTCTTCTCCAAG 

GAAGGCTATCCTCTTTCTGCCCCTCAGTGCTCTCCATGCCTCCTCTATGC 

CTCCATGCCTGCTTTCCAACCCTGCAGAGGTGGAGAAGTTGCTAATCTGC 

TGTGTTGACATGTGCTGGGGTGCCTTGGGCCAGGGAGCAGGCTGGTGGTG 

TGCTGATAGCCCGTGGCTGTGCCCAGGTCCATGCTCACTTCCTGAGCCCC 

AGTGGAGTAGGCTCCCTTTCCCTTATTGCAGCACTCAGAGGAAGGACGTG 

CTTCTTAGGACAGATCTGGCCAACCTCTCCCTCGTGAGAGAAGGCCCAGC 

CATCCTCTTGCCCTCTTTCTTTCTCCTGCCCCCGAGTAATAAAGGTGCCT 

GGTCAGAGCCTTCTAGAAGGAGACCCAAACATCCACCACACATTCCCAGT 

TCCAACCGTCATCCACATGGCTGGCTGTGCAGGTAAACGCAGAGTCTGTT 

TCACACACCCAACCATCTAGTATTGGATGGGAGGACAGTAGCGTGACACT 

CTTCTCCAGCCTTGAGCCCTACTGTGGGCCCCACCCAACCCAGATACCAG 

AGGAGCCCTGTACTGGGATGCTATTGGATGCTTGTCCAGTCATGTACAAA 

GTTAGCCCTTTGTTATATAGAGTTAGCTACGTACATCTTCCTCTGTAGGG 

AACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTAACCTGCAAATCCT 

CTGCTGAGCACCGTGCACTACATACAGTGGGTAGCATGTGGTAGGTGCTC 

AATAACTATTGACCGATCTATTGAATACACGTAAGATCGTGACACTATCT 

AAAACGNGGGGTGTGGGGGAAAAACCCCCCCCTTGTTTAGGAAACCCAAA 

TTGGAC CGTGTTGGC 

>Concig32 

GCGCGATTGTGCTAAAGATCATGCATGCCTGATCAAACGTCCCCATATGG 
CGTCTCAGAGTCAACTCCTTCCCCATCAGTGCCCTGACTTCGGCATAACA 
AACCTGGCAGGTTAAGTGATTAATCGGTCCTGTACAACTGTAGCCCTTAG 
CAGGAAGCACTAAGCTTCGTTTTCATTTATTTCTTCCCTGGAACTGCAAG 
AAATGAGGGATGCCTTCCGCCATGAAGTTTTGCTGATTGTCCACTTT G TT 
CTCAAGGAGATATTCACAGTTTTTAATTTGTCTTTCTCTCCTGCATGGTC 
TCCAAACCTGTCCAAAGAAGCCAGCTGGCTCCATCATCTGTAAAATCACC 
ATTGTCACCAGAGCACTTGACTTCCTGTTGCCCTACAATCCACCTGCACT 
TTATTTC CTGC CACCATGATAATGTAGTGTTACTACATTTTACATTCAGC 
TGTAAGAAATGTTACATTCATTTACTTAAATCAAATTAAGTCTGCTCACT 
CAGTCCCCCACAGTGACCAACTTATAAAAGAGAAGGTACATTTCAGTCAT 
CACTGAGGTTCTCTCTACCACTGGAAAACTGAGGAAGGGTCTGGAGTCCA 
CAGTGGTTAACATCATTGCCTCTGTTTTTTCTCCTACTCAATGTAACCAT 
CCAAGGTTACTCACAAATTCACAAAAAGAGGTCTTCACCTCTGCTCTCAA 
GACCCAGAGGGCTGGGTTCTAAACTCAAAGGCCAATGTTCCCCAACTTTT 
TGCATTGTTTCAACATTGGGGAAAACTCGAGGGGATTCAAGAATGGTTAT 
ATAAGTTTTGTGGAAAAATGTATAATTTTTTAAAATTAAAATACAAAGTA 
TTATGGAAAGCACTAAATATTGAATTTATATAAATATTCCAAATATTTTT 
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ZTAAATTTTTAGTGAGAAACTTGAGCTTGCTTCTGTGAGATATTTATTTT 

AAAACAGATTTGACACrrAAAATGTCTAATCAAGCCTTTTAAACCATGAT 

CTATCTCTTCAAATTCTTCAGATGCCACCATCAATAAAGAAACTTTGTTC 

ACACAAGTAAGTGGTAGCAAATGGCAGGGTG7TTATCATTTTTTTTTTTT 

CTTTTTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGG 

CGCGATCTCAGCTCACTGCAAGTTCCACCTGCTGGGTTCACGCCCTTCTC 

CTCCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCTGCCACCACGCC 

CGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCC 

AGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCA 

AAGTGCTGGGATGACAGGCGTGAGCCACCGCGCCCCCGCCGTGTTTATCA 

TTTTTTGCCTGATGAAATTTTCCTTGCCACTACTCTGGATGGTTTGATAC 

ATTTAAATTGTGCTTCCAGGGTACAATTATCCTTTAAATCTATACCTCTT 

TCCTTTCTTTTATTGACAAATATAATGTTACACTTTTCTGTCATTGCAGC 

C AC AC CAC CAGTACACAG AT C C CAACAGAGTTGTAAT ATTTTATTAGTTT 

CAGAGTTTCAATATTTTATCACTTTCAATACTTCATGTGCAGGAGTTTTA 

TTTGGTACTTCTTTACAAAATAAATGATGTGCTTCCAAGCATrrCTTTTC 

AATAATTCC^ATCAATGTTATTAACTGAGTAATACTAGTATCTGTTTATT 

CATAAATTCACAGGAAATGCTTTTTTACTTATTAGTCTTTGGAATTCTGT 

TGTTTGTATAAACATC7TTCATGATGGCTTTGTGTCTACCAATAGCACTA 

TTGCCAAAAGGCACCTTTTTCTTGTTCCTTTACTTCACTGGTCCG^AGCC 

TGGTACCAACAACTACCACACAGACTGGGAAATGAGCAATTTTGCCACGT 

GCCCTTAGCTATTAATGGTGGCACTCCATAACTAGCATCTTAAGCTCAAT 

TTCATGAAAGAAATGTGTTTCTTATTTTGTACTTGCAGGCACTTTTTAAA 

CTTGTAATCTTTTATTCATACTTTAAAATTAAAACAGAGTAATAGAACCC 

ATAGAAGGAAATCAATACCCACGAGTCCATACTGATATAAATAAATAGTT 

ACATAAATAAATGGGGGGAGAAATAACAGCTCTTCCTTACAGAAAAATTT 

CAATTAATAAATGAAGAAGGAATTAGGGAAATACAACGTTACCATTAAGC 

AACCACAGTAATAATCATTACAGGCAATATCCAAAAATAAATTCCAAAGC 

CAGTGGGCAAAAGTTTGAGGAGATACAGGATATTAACATAGTCTCCAAAT 

AGCTCATGCTATTTATAAATTACAAAAGGAAACATAACAACTGTATAGTG 

AAGAAACTCAGCAGACACCACCTTAGCCAAGTGATCAAGGTTAACGTCAC 

TAGTAATAGGGCTTGTTGACATACTGGACTCCAATCTGATACACTGATAA 

GGACACATGACTTCTGCAGTATTCTTACCAAAAACAGAATTCTAATGTAA 

TTAAGGAAAATGTCAGACAAACCTATTCTGAGAAACATTCTATAAAACAA 

CTAACCAATACTTTCAAAATTGTCAAGGTCATAAAGACCAGGCGATGGTC 

ACAGATTTGAGGAGACTAAGGAGATACAACAACTAAATACACAAATGGAA 

C CATGGCATTCTTGATTGGATCTTGAAACAGAAAAAGGATATTAGGAAGA 

AAAGCTGATGAAATTCTAATACATTCTGTAGTTTAATTAATAGTATTGTA 

C CAATATTAATTTCCTAGATTTGATCATTATACTATGGTTAAGTTTTTAA 

CATTAGAGGAATCTGGGAGAATGGTATATATGAACTCCACTGTTCATTCA 

ACTTTTTCAGTAACTATTATTTCAAAATAAAGTT 

>Concig33 

GGGAGCGGCGGCCCACGCTGATCTCTAAAGCTTTAGACCACATTGGCTCG 
AGCATGGTCATGGCCGTTTCCTG 

>Contig34 

GACGTCTTAGCGCTATATTATAAAGAAATATTCACCTCCCTGCTGAGCTT 
ACAGGGTGTACCTAATGTCCAACAATATGAAATCTCTTCAATGAATTGCA 
GCACGTCCATATATAACCCACATGGAAGCTGTCCTCTTTCCTCACCTTCG 
AACTTCCCATGCCAAAGAGGGACCTCTTGGACTCAAATACATCTTAGCAA 
TATAGAAGATGCTGGAGACTTGTAGGAGAAGTGGAGAGGGTTTACAGTGT 
AGCCCCACAGAAAACAACTTATGACCCCATCAGTCACTTGTCCCTTTTTT 
C CATGCC7CAGTCTAGTCAGGAAACCACTAGATCCTGGATGGCTTCTTCT 
CCCTTCCCCTCCTTTCTCTTCTCCTCTCCCTCCCTTGCTCCTCCTTCCTC 
CATCACCCACTCCTTACTTCCAACCAAAACTTGACTAGCTCCAGTCTCAT 
CCCTCCTTATTGAAAACTATTTTACTCAGCCCTCCTCCCCCACTCCTGCC 
CAATCTTTATTCCTTACCTACATCAGACTTCACCAAAACAAAGGCCAGGA 
TAATAAACAGGACAAACTCTTTCAAACACATTTTAATGACCATATTTTGT 
TATTTTGGTACAATTTGAGGAGTCCCAATCCCCAGGGAAGACTAACAAGA 
AGTTCTCCTAACAAAGGTGGGTCTCCCC7TACTAAAAACTCCTGTAATGG 
CTGAAAAGAGCATGAGGTTTTCTGCATATCATTACACATTCAATAGAACG 
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TCATGCAGCTGTTAAAA^xGATCTGTAG.^GGC7ATCTTGTGACAGAAAo- 

GCATTGGAGATATACTGTTAGTGACAAAAATAGGTTATAAATGAATTTTT 

CCATGCATGCCTCTATATTTATAAATACACACACATAAAAGACAGGAAGG 

ACAGACATTAAACATTCATAGTGCTTAAGATGATGCATAGTATAATAGTT 

AGGACCATGGCCTTTGGGACAGAAAACTACAGCCTCTCTCCCACTTATCA 

GCCATGGGACCTTGGGCAATTTGCTCAGCCTCAAAGCCCCTGTTCCTTTA 

TCXGTGTGCTGGGGTTGTTGTAAGAGTTAAGTGCAATACACAGAGAGAGA 

GAGAGTACCTAACATGTATTATGTGCTCAGTCAATATGCATCATAGTACT 

CATTGTTACATATGTTCCTAAGTGCTTTATACGTTTTTTCCCTAAGTTGA 

CCATCTGTTTTTGGCATTATGAAACATAATGATCCTAACAAATTAAAATT 

AAAAACATAAAGAATATTTGCCCCAAAAAAATAAAGAACATGAATTCTTC 

AAGTAGCCAAGGGGCCATAGACAGAAGTAAGCCCTTGGTGGGGCTTAGTT 

GAGAGAAGTCTCCAGAAGGTCTTTCGTGTGTTAAAGAAGAGGGTAACAGG 

GAGGAGGTGGGGAGAGATGTTAACTGAGTCTAAATGAGCACCTGGAAGAA 

GAGATGGGACAGGCCACTTCTGCCTGGACTCCCTGATTGTTAAGAAGAAT 

GAAAAAGAGCAGAAGTCTTCCCTGAGCCCAACTTCACTCCCT5ACTTAAC 

CTAGTCTTTGCCCCTTCCCTCTCACTCATGGCTACTTTCTGTGGTCACCT 

TGTTGTAGAAATGGATGTGCAGCCACCTCATCTTTTTCTACCTCCTTCAC 

ATGTTTTAGATAATTTAATGTAGTAGAAGACGGTTACAGCAAAAAATTAC 

AAAAATCAAAATATCTCTGCTATCTACTGTTGCATTTCTAACCATCCCAA 

AACAGTAGCTGAAAACAGCACTCGTGGTCGAGCGCGGTGACTCATGCCTT 

TAATTC CAGATACTCCGGAGGCTGAGGCAAGAGAATCACTTGAACCCGGA 

AGGTGGAGG7TGCAGTGACTCAAGATCATGCCACTGCACTCCAGCCTGGG 

TGACACAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAGCACTCGTG 

TATTTTGTTCAAGATCTGTGGTTTGGGCAGGGCAGGGCTCAATGAGGACA 

TCTCGTCTCCGTTCCCGCAGTGTCAGGAAGTGTAACTGAGACTGGAGGGT 

CACACAGAAGATGGCTCCCTCAAGTGGCCAGCAAATTGGTGCTTACAATT 

GACAGGGAGCTGTTGACCAAGGGCCCCAATTCCTCTTCCTATGGCCCCTT 

CTCGGGCTGCATGGGCTTCTTTACAGAATGGCAGCTGGATTCCAAGAGCA 

AGTATCACAACCTACAGAAGAGTGGAGGAATATTGAAAGTTCACAGTCTC 

TTAAGACGTTGGCCCAGAAACTGGCAAAAGCTTCATTTCTGCCATGTTCT 

ATTGATCAGTCACAGAACCTGCACCAATTCAAGAGGAGAACATATAGAGG 

ACATCTCTCAATGGGATAAGTGTCAACAAATTTGCATCTATCACAATCTG 

TCTTTTGGGTACAAACTATTTCTATTCCTCCATTATGCAAAATATACTCA 

CAACCTCCCAGGGGTCGCAAAAGCCTCATCCATTTATGGCAAATGTGGCC 

CTTTTAATTTATATAAAATAATTTGCGGGGGCTTCCTTTATATTTTTAAC 

TCCCCTGC 

>Ccnci255 

GTGCAGAGAAGTGATTTAAAGCCCTTCAGAAAGAATGCTTTATTCCCGTG 

GAATTTGGTAACTTGCTTGGGTGTGGGGAGGTTTGTCAGCTTTCTCCACT 

CAAATTATCAGACCCTTTCCATTTAGTGGTAGACCATTTCCCTCGTCCAG 

GCCAAGGGCACATAGTACAGAGAAATAGGGAGTTGTTACCCAGGGAGAGA 

ACTTGGCTCTAAACCTGTAATAGAAAGGTCAGTTCTGGTCTGGAGGGTCA 

ATTTTGATCTTTGGCTCAQATCCAGGAATTGGAACCAAGGCTTTTGAACA 

TTTTAATGCAGGGGATTAAAAAAATGATACGAGTCATTCACGAATATATT 

TGCTTAACATCTAAAGAGATCCCTCAAAACACTAGAAAAAATAAG AACAA 

AAATCTAATAAAACAAAATTTGTTAAACACATTTACCAAATTTTTTTTCT 

TGGTAAAAATTCAAATGTCATAAATAAAGCTAAAGTTCCTCTTGATGACT 

CGCTCCTCTGCCCTATTCCACTCCAAGTAACCACTATTATCAGTCTTGCC 

AATACCCTTCCAGACCTCTCTACCTCTATATACCATTAGAAGCACATGGT 

TTTGCATTGAGGATGTGCAGTGTTTTGTTTTACGTAAATGTTATCACTCT 

GTTCTTGTTCCATAATTTGCCTTTTTCTCTCAATGATTTGCTTGGCTATC 

TTTCTATTTCAGTAGCATCTCCTTTCTTTTTAACTTACCATTGTTTATTT 

AACCTTGCCTCTATCAACAGATATGTAGGTTGTTTCTAGTT GATTT CATT 

AAGTATTTATAAACAACGCATCAGTAGATGTCCATAAATTTCTTTACGGA 

AGATGGCAAGTAGTGGAATTGCTGAGCCAAAGAACATGTTTAAAAAACCC 

AAAAAAACTAGACGCTACCAATTTTCTCTCCAAAATGGCCATACCCACTT 

ACCCATACAGAGATGATTTGGAATCTGGCTTCCTCACAAGGTGAGATGCC 

TTCACAGTTTCATTCTTCCTGGCATGTCTTCCCTTTTGTATCTGAGAGAG 

CTGGCAGAATTGTGTCACTAAATCAAGGATAGAGGGTCAAATGACAGCTC 
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AAGCTCACAGGCACCTCiGCTTTCTTCu^GACCACCTGCTTTCCTGCCa 

CCAGC7CTGTTCCATCTTATAGAATGGTTGCCACTTGGGTGTCTGCTCCG 

ACAGCCATGTCATCCTTTGCACTGCAGTTATGAAGCAGACAGAGCTAGGA 

GAGGGGCTTTGCCAGCC7CTGCCCTAGCTTGGAGAACTTCAAAAAAGGAG 

GGTATTGAAGTTGAACTCCCCCAAAAAGGGGTGGTCCCCACACCTCAAAA 

AGTGGTGCCTCCGAAAGAAATGTAAAATTCGTGTGGGGGGGGAAAAAGGT 

TAJTTAGAAATTGTTGGCTTGTCGTGCCGAAAGTATGTGTGGTTACGGGG 

AGTACGG AAATTTCGAGGGGTGGGGGCGAGGC CGTGTGTCCTTTAGC CCG 

GGGTTTTCCCGTCGCATGTTTAAGGGGGGGGAAGAGGGGGGATGTTTTCT 

TTCCGCGAAGG7TTTTGAAGAACGGCGTGG 

>Concig36 

CCCCCCACCGCCACTACTCAACCGGCCGTTCACGAAACAACTCGCCACAT 

CCACTAACCCGCTGGCTCACCACCCACCGCCCTCCCGATCCCCCCAATCC 

AAACTCAACCCCCACCACCAAGCGCCTCCCCCCTCCCCCACCCTCCAGCT 

CAGCCCCAACCTACCACCAACCCCGACTCGCCCACCGAAAACCAACAGCA 

AACCCAAATGCCCACAAAACCAGTGTCCAAACCCTCCTTCCGATCAGTTT 

GGTGGGCCCATCACCGCTTCCCCTGGCCCAGGCTCTCCTTTTGTGCGCTT 

GGAGCAGCAGACTGATCTCCCAGCCTTCACTCACTTCATGTGGTAATCTG 

TTGTGTTCATCACTGTCAGAATCTTCTGCATCCCCTCACTACTCTGCTGA 

AAACACTCTAGTGGTTCCTCATTGCTCATTAATGAAAGTCTAGATATTAA 

ACGTAGAAGGCCCAGCACAATTTGCCCCTATGCCACCTACCTCTCTAATC 

TTTTCTCCTTACTCTGACAGACTCTCCGTCTGTCATTTATGTATTCTTTT 

ATTGCTCTCTTCTACTTTTAGTATGAACTGGATTTATGGATTTTTTTAAC 

ATTGCTTTCAAGTATGGAATAAAGAATTTTATTTATTTATTTATTTATT7 

ATTTGAGACTGGGTCTCACTCTGTTGCCCAGGCCAGAATGCAATGGTGCA 

GTCATATCTCACTGTAACCTCGAATTCCTAGGCTCAAGCCATCCTCCTGC 

CTCAGCCTCCTAAGTAGCTATGACTACGGGTGTGCATCACCACATCTGGC 

TAATGGAATAAAATATTACAATGCCTAATCTTAATTTTCAAAATTTTAAA 

TTACATTGTACCTAATGCCCATGCATTTACTTTTTTCAGTGGGTCAATAG 

CCCTCACTTTGGCAAAGGTCCCAGGCCCAAGGTAAGGCCTTACTTTTTCC 

AAACTCATCTTTTGAAAGACATAAGTGCCTGTAAGTTGTACCACATTAGG 

TTCTAGGAATTTTTCATCAAAGACTTTATCAGACTATTTTCCTCTAAGTT 

GAGAAAGAGCTGGGGGCAGAATATGGCACTGAATGACTGAAGAGAAGGCA 

CTGAAATCAGGCCAGAGGTTGCTGGAAAGAGCAATGAGGAACACCAGCAG 

CAATGAGGAGCCGGTGATGATTTTGGCTTCACAGGGAGGTGTGTACCACA 

CCGATTTTATCTCTACGTGGATGAACCACAGCTGTCGGCTCCCTTGTCTC 

CAGGACATCACACTCTCCACATTCCCTCCCATC7TCCGGCTTCTGCTTCC 

CGGGGCCCTCATCTGCCCCATCCTGGGTGAACACTGGTCGGTCAACTGCT 

GGGCGTACCTTCCCGCTCTGCACACCCTCCCTGGCCACCCCACCCACTCT 

CACGGCTCGCACTGCAGAGGAGCCGCATCTCTAGCTCCAGCCCATCTGCC 

TCTTCTGAGCTCTAACTTCATGTAGGCGACTCCTGCCGGTGTTGCCTCAC 

AGGCCCATCATACTTCAAAGCATTTTCCCCTCAGAACACCATGTCCTGGC 

TGCTCCCTCCAGAAGATACATCTCTCAAGCACATCCCCGCGGCTCTCACC 

TGGATGACTGCATTCACCTTCTCCCACATTTGCCCCCTCCTTTGGATGTA 

TATAGATTGTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAA 

ACACCTCAAAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACA 

GGTTGCCCATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCC 

CTTGCCCCTCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTC 

TCTAACACACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTC 

CCATTGCCTGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCAT 

CTGATTAATCCCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTC 

TCTCTGACTTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAG 

CACTCTGTATTACTCCTTTCTTAATGACCACCTGCTGTAGACAGAATGTT 

TGTCTTCCTCCAAAATCATATGTAAAACCTTCCACCAGAGCGATGATTAG 

AGAAGCCTCCC 

>Contig37 

GACTGACATTCAGAAGATATTAATAAGAGCACTAATGATGGGGATTGCAA 
CCATGTCTTTACTGACTTCCAGAAGCTTCTTACAGTAAACATGAAATCAC 
ATAATTTCTTCCACTTTCCTACTGTTTCTTGTTCTGGGCTCTGTCCTGCT 
TACTGTCTAATATCTTGGCCCCTTAAAAGTTGCTAAT.CTTCCAAACCTCA 
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TTCCTGTGACTGGGCCLiwTGGTCCTTG^-CATGGGCCTTGAAGATACIwm 

C7G7ACAC77A7C7GGAGCA7CCAG7GCC7ACCACC7GACCCAGA77CC7 

CA77GCGC7CC7CCC7CC7CCACC7AA7GGGA7T7GC7CA7ACCCG7G7G 

GGACCCCTCCCATTTTCCCCAACTGAATACTTATCAAGACAACGCATTGC 

CATACTCCCTCGTACCCTGCTCTGGGCATCAGACTGAATGTTTGTTTCCA 

77GAGGA7C7GCAGCTGCA7CAG777CCCCAGCACCG7CCAACCCC77GA 

GCATGGCTAGTCCTAAAGCAGAGAATTAGCCTTTCTATCCCTGCTGCTAT 

ACATGCTGGGAOUUVTAATAAGAAATGAC^GCATTTTATGATAATGCAGG 

C7GCAGGAGGCAGGAGGCAGGAA7CAAA7TCG7GC77A7CAAA7AG7GC7 

CCAATTCTTTGAATATTGGACTATAGAATATGTCATGGATCTATGCTCAG 

GTGGGTTCCCTATTACTCACTCCACTGAGGCCAGGTTGTGGGATTAGCTG 

TCCAAGAGGGAGTTTCAGTCTCACAGCATAGGGTCATTCTGAGAATTACT 

GGCCCACACTTGTGTGGAGACCTCCAGAGAACAGAATCTGGGTTGGTGCC 

ATGTACTTCCAGGAGGAGAGAAGTGGCAGGATGCCCAGCCCCACAATCAG 

AGGGGAAGGGGCAGAGCCACATGTATGAAGATCCTCTCCCCAGTACGTGC 

CAATCACAGGGCTTCCTAGCTTTTGGGCCAAGGAAACAATGTGGGAAGCA 

AAAAAGGACAATTTTCTCCTCCCTTTGCATGAAGACTGAGCAGTTTTACC 

AGATTCCCAGGGAAACACCCTTCCACTCTGGGTTGAATGTGAGTGAGAGA 

CATTCAGCTGGAACACTAGAAAAACTATTTCCTGAGCCACTCACCTTTAG 

CCCTAGAAAGTGTTGGATTTGTCCTTCATCTTTGCCACAGTAGAGACTGC 

TGATAGCATCAGAACTTGGGCTCTGGAATTAGACAGATATGGGTACAAAT 

CTGAGCTCTCTCACTTATTAGTGTGGGATGTAGAGCAACTTTTAAAATCC 

TTCCAAACCTCAGACTTCTCATGCATGATG7GAGGATTGTAATAGGGCCC 

ACCTAATAGGGGTTTTTGAGAATTAAAAAAGTTATTCAATGAACAGCATT 

TAGCAAGATGCCTGACCATTGAGAAAATAACAAATTGTTTATTATTATTG 

7TA77A77AAACA7CT7TCC7GCACCTTC7GAC7GGGGGCA7CG7ATCA7 

CAGAAATACTTAGGATGGGATGGATTCCTGCATGGGCTGAGTCAAGGGTG 

CAATAATGGAGGAGTGAAGAAGGAAGAAATGGAGGCAGAAATCCCCAGGA 

GCCCAGCATGGTACAAGGCTGAGCTAGTGCTGCAGAGCCTCCTTGGAACA 

GCCACAGAGCTTGCATCTGGCCCTGGAGGAACCTCTTCTAGCTGGCAGGA 

CCAGCCACAACAGTGGCCAGGGGATTTCCCAGGGCGTGGGCTCCTCAGGA 

GTTCATTTGGACCAAGCCTGCCTGGAGAGGGGTTATAACAGGGATCCTTC 

CCTACTGGCAGGTGATTTACCCCTCGGTGAGAAGCTCAGGCATTTGTTTG 

ATGGAAGGTGGAAGGCCCTGTGCTGGGCCAGTGACTATCAGGGATGGGCG 

GGTGGCTGGAAAATAGCAAATAAGACAATATGATAACACAGTTAACCACC 

ACACTATGTGAAGCTACAATATGGGTATCTGTAATAGACAATTCCAATGT 

AGAGAATAATTTTAAGGTGTCATTCTCCCCGCCAATGCCATAAGCACACG 

GCCTCTGCCTGGGTTTCTCACTGTGGAATGTCCTCCTGGTCTCCTCATGC 

3CAGAGAGTGGGAAGTACTCCTACTTTAACACCGGCTTTCCTGTCATTTC 

CNTGCAGCCCTCCTCAGCCCCCTCTGCACAGGGAGGTTTCCTCCCTGCTG 

CTGCAGTGCTTTGTACTTGTTAGTGGTACCTGCACACAGGTATTGGTGTC 

CTTGTCTCACCACCCTACATCACTGTAAGCTCCCCAGGAGCAGGCTTCCT 

GTTTGACTCACCTGTGATCCTCCACCTCCCACCCTGTAGTGCCTCAAGCA 

TTCTGTAGAGCACATGGACGCC 

>Concig38 .. 

GACTAATAAGTACTTCATTATTTGGGTATTTTCCAAGAACAACATATTGT 
AGGAAACCATTCTTTCTAAAAAAAAAAGTGTCCTTTTAAAAAGGTGAATA 
ATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTATGTATAAAACAAGGTA 
AAAGGAACAAGGAAATAAGGGAAATGTAAAGAAAATTATAGAAATAAAGT 
GGTATTTTTTGGTAAGAAAGCTTAAAGAGAAATAATTTTAGGTAAGAAAG 
AATCTTACCTAAAATTTTGTGCTAGAATAAAGTGACTGGCTAAGAAAGGG 
ATGTTCAAAGCTATTTATGACAAACCCACAGCCAATATCATACTGAATGG 
GCAAAAGCTGGAAACATTCCCTTTGAGAACTGGCACAAGACAAGGATGTC 
CTCTCTCACCACTCCTATTCAACATAGTATCGGAAGTTCTGGCCAGGGCA 
ATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAATAGGAAGAGAGGAAGT 
CAAATTTTCTCCGTTTGCAGATGCATGATTGCATATTTAGAAAACCCCAT 
CATTTCAGCCCCAAAACTCCTTAAGCTGATAAGCAACTTCAGCAAAG7CT 
CAGGATACAAAATCAATGTGCAAAAATCACAGGCATTCCTATACACCAAT 
AATAGAC7AACAGAGAGCCAAATCATGAGTGAACTCCCATTCACAATTGC 
7ACAAAGAGAATAAAATACCTGGGAATACAAC7TACAATGGACATGAAAG 
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AC CTT7T CAGGGTGAAv. . GCAAACCAC* ^CTCAAGGAAATAAGAGAGu^A 

ACAAACAAATGGAAAAACATTCCATGCTTATGGATAGGAAGAATCAATAT 

CGTGAAAATGGCCATACTGCCCAAGTAATTTATAGATTCAATGCTATCCC 

CATCAAGCTACCATTGACTTTCTTCACAGAATTAGAAAAAACTAATAGCC 

AAGACAATCCTAAGCAAAAAGAACAAAGCTGGAGGCATTG TGCTACCTGA 

CTTCAAACTATACTACAAGGCTGCAGTAACCAAAACAGCATGGTACTGGT 

ACCAAAACAGATATATAGACCAAAAGAACAGAACAGAGGCCTCAGATATA 

ACXCCACACATCTACAACCATCTGATCTTTGACAAACCTAACAAAAATAA 

GCAATGGGGAAAATAATTCCCTATTTAATAAATGATGTTGGGAAAACTGG 

TTAGCCATATGCTGAAAACTGAAACTGGACCCCTTCCTTACAACTTATAC 

.AAPlAATCAACTCAAGATGGATTAAAGATTTAAACATGGCTGGGCATGGTG 

GCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGATGGGTGGATCAT 

GAGGTCAGGAGATGGAGACCATCCTGACTAACACAGTGAAACCCTGTCTC 

TACTAAAAAATACAAAAAATTAGCTGGGCATGGTGGTGGGCGCCTGTAAT 

CCCAGCTACTTGGGAAGCTAAGGCAGGAGAATGGTGTGAACCCAGGAAGT 

GGAGGTTGCAGTGAGCCAAGATCACGCCACTGCACTCTAGCCTGGGCAAC 

AGAGTGAGACTCCATCTCAATAAATAAATAAATATGGAACTCTCCCAACA 

CAATAATAAGACAAACCCCCAAATGTTTTAAATGGGCAAAAATATTTGAA 

CAGACACTTCACAAAAGAGGATATGTAAATGGTCAAAAAGCACATGAAAA 

GATGTTCAACACCATTGGTCATCAGGGCAAAGAAAACTAGAACCACAATG 

AGATG C CTC TGTACAC CACTTAAATGTC CAAATTAAAG AAAACAAGTTTT 

GGCAAAGTTGTGGAGCAACTGAAATGCTCGTGTATTGCTGGTAGAAAAAC 

AAAATGGCATAACCATCGCAGATAATTTGTTGTCAGTTTCTTACAAAGTT 

AAACATA7ACTTATTGATATGACAGTTCCATTCCAAGAGAAATGAAAACA 

TAAGTCCACACAAAGACTTGTACCTGGGTGTTCATGGTAGCTCTATTCAT 

AATTGCCAAAATCTGGAAACAAATCAAATGTCCATCAGCAATGGAATGGA 

TATACAAATTGTGGTACACATGTACAATAGAAAACTACTCTGCAATGGAG 

AGAAATTAACCATTGACAAACACAAAAACATGGACAAACCTCAAAAACAT 

TATGCTGAGCAAAAGAAGCCAGACACAAAAGACTGCTCAGCGCATGATTC 

CATTCATATGAAATCACAGAAAGGGTCAGTTGAAGGTGCAGAGACAAAAA 

GTAGATCTGCAGTTGCCTGGGGATGGGGTGGGAGGTTGACTGCTCTGACG 

CGTAAGGAAATTTGGGGGTAGGTGGGGGATGGTGGGAATATTTTTTGAAT 

TGAATTGGGTAATAGTTTTAATAGGTAAAATATTGGACCCCACAGTATTT 

GAGATAGG7TTCAGTCAATTTAGACAGTTTATTTTGCCAAGGTTAAGGAT 

GCATCCGTGACCCAGCCTCAGGAGGTCCTGACAACCTGTGCTGAAGGCAG 

TCAACATACAGCTTGCTTTTATTCATCTTAGGGAGACATAATACATCAAT 

CAATGCATGTAAGGTTTACATTGGTTCAATCTGGAAAGGTGAGGGAACTT 

GAAGCAGGGAGCTTCCAGGTTACAAGGTAGATTATTCTCAACAGAAAGGA 

ATGTCTGGGTTATGATAAGCGGTTGTGGAGACCAAGGTTTTATCT7GTAG 

ATGAAGC CT CCGGGTAGCAAGCTTCAGAGGGAATAGATTGTCAAAGTTTC 

CTATCAGACATAAGGTCTGTGTTGATGTTAATGCTGGTCAGCTTTTCCTG 

AATTCCAAAAGGGAGAAGGGTATACTGGGGCATGTCCAACCTTCCCTTCC 

ATCATGACCTGAACTAG I 1"! 1"! 1 CAGGTTAACTTTGGAATGCTCTTGGCC 

AAGAAGAGGGGTCCATTCAGATGGTTGGGGGGGCTTAGAATTTTATTTTT 

GGTTTACAGTGAAGACTTTTCAAGCTAGACACTTAAATGAGTATGTTGCA 

AAATGGCAATTTCTTAGCACGGC 

>Concig39 

GACGTCCTAAAGAAATGCTAAGGTAACTCAATTAACTATGCTAGAAAAGA 
GAGTTAAGTATTTAGGAGGATTTAATATGGTGTTAAAGTTGTGAAAATCA 
AAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAGCCAGGGAAG 
GCCATGAAGAAAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAGACT 
CTGCAAAAAACAAAACCTTGCACAAAGGCCATTGCAACC7TACACAAAAA 
ATACTACTTTAAAAGGACATGTGCCCAGCAACTGCCTGTCCAACCTCAGA 
CTGGCAATATCTTTGTTATTGATCTTAGTAGCCCAGCATAACTATTTCAA 
AACAGTGATGTAATGCTCATTTTTTTTCTTTTGAAAACTTTTGTCCTCCT 
3TAAAAACCTTTGTCTTCTTTACTTACCCTGAATATGCACAGAGTTTACT 
ATGGAGTGCATATTCCTGTTGCAATGCTCTATTCCCAAACAAACATCATT 
TTCTTTTAGAGAGCCTCTCTCTGTTTGTGATTTAGGTTGGTGATGTAAAG 
CAATGGCATAACTGAACACTGATTCAAAGAAAAGTGGCTTTTCTCTTTGT 
TGTATTAAAAAGAGGCCTTATAAATAGGATAGTAAGATTTGTAAGTTGAA 
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CTTAAAGCATGAAGAAiuVTTTAGGGGCCAGGCAGGGTGGCTCACTCCTuT 

AA7CCCAGCAC777GGGAGGCCAAGACAGGAGGA77GCTTGAGCCCAGGA 

G77CAAGACCAG7C7GG7CAACACAGACC7CA7CT77ACTAAAAA7AAAA 

AAATTAGGCCAGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGA 

GGCCAAGGCGGGAGGA7CAC77GAGG7CAGGAG77CG7GACCAGCC7GG7 

CAACACGATGAAACC C CA7CTCTAC7AAAAATACAAAAAAA77AGC7GGG 

707GG7GGCGGGCACC7GCAATCCCAGC7ACTCGGGAGGCT7CAGGCAGG 

GGAATCACTTGAACCTGGGAGGCGGACATTGCAGTGAGCTGAGATAGTCC 

CAC7GCACTCCAGCC7GGGCGAC7CAGCAAGAC7C7GCC7CAAAAAAAAA 

AAAAAAAATTAGTCAGGTGTGGTAGCACACAGCTGTGGTCCCAGCTACTC 

GGGAGGC7GAGG7GGGAGGATCA7C7GAGCCCAGGAGG7CAAGGC7GCGG 

TAAGAGCTGAGATTGTACTACTGCATTCCAGCAGGGGCTACAAAGTGAGA 

CCCTGTCTCAAAAAAAGAAAAAGAAAAAGAAAATTATGTTTTTAAATTTA 

TAATTATAATAAATTTAATTACATAAATTTAAGCTCAAGTAATTGTAAAT 

A77C777C7G7GCACATAAGTTAT7C77G7AT7GACCCCACAGGAGC7GG 

CCATTCTTCAAGTCAGAAGGCCTGAGAGAGGAGCTGCCCAGGTGGTCTTC 

ATGGGGC7G7GCGGCCAGTCATCCCCCACAGG7TGACAATCC77G7GTAC 

TTCA7CC7CG77GGATCCTC7G7A7CCC7GACGA7GAGCAAC7GTGAGGC 

CCGTTTCAGCACTGAGTTCCAGTCAGGAAAACATCCACCCACCCACCACA 

CGCTCACACTTACACACACATTCACACATGCACACACGTTCTGGCTCCGA 

AAAAGAAAAAAAAAAAGCAATTTAAAATAATTCTGATCCTTTGCTTATTT 

CCACAAAC7CCA7GAAAA77GTACAT7GTCCAAGCAACA7T7CT7AA7A7 

TCTCTTTTTCTCTCATATCCATTTTCCTTACTGCTGTCTCCACCTTTCTC 

T7CCAAAC7CCC7G7TAAAA7CCCTGCCCCAGCGAACTT77A77CAATT7 

TGTGGAATGGAGGCTGCTCTGATTTAAATTAAAAAAAAAAAAAAAATCCC 

TACTCCATGTCCCAGATCCCTAGTTGTTTrTTGTTTTTTGTTTTCCTGAG 

ACAGGGTCTTGTGTCTTCCATGCTGGAGTGCAGTGGCATGATCATGGCTC 

ACTGCAGCCTCAACCTCCTGGGCTCAAGTAAATCTCTTGCGTCAGCCCTC 

CCCAGTAGCTGGGAGrrCAGGTATGTGCTACCATGCCTAGCTAATTTTTT 

TCTTTTATTTTGTAGAGACACGGTCTTGCCAGGTTGCCCAGGCTGGTATA 

GAACCCCTGGGCTTAAGTGATCCTCCTGCCTCGGCTTCCCAAAGTGCTGG 

GATTACAAGTGTGAGGCACTGCACCCAGGCTGGATCCCTGCATTTTTACA 

GATTTAGCATCACAAAAGTCTAAACAATTAGACTGACTAAGGCAGAACTG 

CCCTTATGACAGCAGACATAAGAAGGAAAAGGCCAAAACACTGTGTTAAA 

AATT AT C C AAATGTGAGGAAAAGGCAAAGAGAGTAGGTGTGCC7TTTTAG 

TGTCTAAGCTGCCTGCCCAAGGGGCATCTGATGCTCTCAGGCAGGAGTCC 

ACAAA77777T7T7GTAAAAGATCAGATAG7AAA7C77T7CAGCG7GAAG 

AGCA7GAGG7C7C7G7CACAAATAC7CAACCACCATTACAACA7GAAAGC 

AGCCAACAGACAACACATGACAAATGAGTGTGGCTGTGTTCCAGTAAATC 

TTGATTACAAAAACAGGCAAGAGGCCAGAGCTGACCCATGGGCCATAGTT 

TGCTGACCCCTTCTGTAAAGGAAAGTATTTTTGT7TGACTTGCTGTTTAC 

CATTGATTGAACACAAGGCTCTGTAGAGTTACTTGTTAACTTGCAGAAGA 

TTGATGAG7GGCAAGTAATTTTTATTCACCAGAATATANNATTATTCTGT 

TCAG7AGATAAGATAAACCCACTGTTATATTACTGTCTTGTTTAGAATGT 

GACT7TGATTCATTTTTTCACAAATTCATATTATTGCCC7AA7TTGTATA 

7AAG7A7GCTTCTTTTAAAAATATATATTTTT7AATAAATTTGAGACAGG 

GTCTCAC7AGGTTGCCCAGCCTTTTGCTATAATGAGAGCATAAAGTGAA7 

TTCACAC7T7AGCCTAGTGCATAGATGGGA7TACAGGCACAAACCACTGC 

ATGCAGC7AAC7T7GCTTCTCATTCCAGCACG7TCTATTCCNNNGNTTTT 

CATA7ACGCGTCTCTTAATGC 

>Concia40 

CGCAT7CAGCCCAAGTTT7CTTCAG7GT7AAGGT7TTTGT7AC7C7G7GC 
CCAAA7G7CC7TCCAAAAAGGTTAAGTTTTTTTACCT7CCTGCCAACATT 
ATA7GAAAG7G7CCACTTTTG7AGACT7T7ACCAATGC7GAC7AC TTTTG 
G7TTC^AAAAAGCTCTCAGTAATrTTCTATTAA77ACTT7TACCCTTTTT 
7ATTGAGGG7G77CAACTTTT7ATTG7TAGCATATTCTCTCTGGGCTCCA 
77GGAC3CC77GGCAGCTTTTTGGTAG7AGGTGCCTTTAGAAAAGTCCTT 
C7CG7C7GGCCC7TTCTGAGCAAATC7AG7GAACAGAA77GGCTCCATGC 
7CAGCA77GC7TAATACGGTTGATCCAGGGCC7AGGAC7CATTCC77CA7 
TACCA7CCAC77GCA7TGTCT7AAAGCAAGGC7C7A7TAA7T7AAT77GG 
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CATTTCCTGTCCCAGCTCrfTAGTTTCA^ 

TCCCAGTAGATGCCTATGTTGCTTCCTTTTAAAAAATTTTGGAGCTGTTT 

CCCTAGCCTAACCTTTTCTTCAGGGCAGGAGTTAAGTCCCTTCTACTGCA 

TTCCTGTGAAGATGGTGATTCAAGAGGCAGGGCACCTGTTGCTTTGTGAA 

ACAGTCCACTCTGCAGCTGGGCAGCTCTGTTACTAGAATGTTCTCCCTTC 

TGGGGAGCCAATATTTTGATGTCCTCTGTGAATCTCATCTGCTTATCCCA 

TCSGTTTATGTCCTTGAAGATGCACAGGTCTGACACCACGAGGTAGCCCT 

TAGAAA7TTGATGGCATTTCTGATGTGTCCCCAACTCTTCTCCAACCACT 

CCTCCCAGAGCT7GTTTCTTAAGCCCCTTGTGGAGCTGATTGCTTTCCTC 

AAGGCAGCTCAGTTTTTCCCAGTTTGCTCCTGGTGGTCCTGAAATATGAT 

TGACTCCTGAATACTCCAGGTGTGAAGGAGAGTGGGGGTGGCCTTTCTAC 

TTGTCATGGCCTGGGTTTTAAGTTGCTGTCCAGTGGAGCAGAGGTGACTT 

TCCCAGTGAACTACATTTTTTCCCCTCTAAATCCTTAGCAATTTTGTCTC 

CAGAGGCAAGACCTGGCCAAACCATTTGTGTTGAGGATTGAATCAAGAAT 

GATTGAGGAGATGACAGTAGTCCCCCCTCATCTGAGGAGGGCGTGTTCCA 

AGCCCCTCAGTGAATGCCTGAAACTGTGGATAGTACCCAACTCTATATGT 

CTATGATTTTCCTATAAATTAA7ACATGCCTGTGACAATGTTTAATT7AT 

AAATTAGGCAAAGAGGCCAGGCGCAGTGGCTCAAGCCTGTAATCCCAGCA 

CTTTAGGAGGCTGAGGCCTCACCTGAGGTCAGGAGTTCGAGACCAGCCTG 

ACCAACATGGAGAAACCCCGCCTCTACTAAAAATACAAAATTAGCTGGGC 

ATGG7GGCAGGCGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAG 

AATCACTTGAACCCGGGAGGCGGGATTTGCGGTGAGCTGAGATCGTCTCA 

TTGCACTACAGCCTGGGCAACAAGAGTGAAACTCCGACTCAAAAAAAAAA 

AAAAAAATTAGGCAAAGAAAGAAATTAACAACAATAAGTAATGAAATAGA 

ACAATTCTAACAATATACTATAATAAAAGTTGTATGAATGTGGTCTCTTT 

CTCAAAATTACCTTTTTTTTTTGAGACAGGGTCTCACTTTATTGCCCAGG 

CTGGAGTGCAGTGGCACGATCACAGCTTACTGCTGCCTCGACCTCCTGGG 

ACCAAGTGATCCTCCCACTTTAGCCTCCTGAGTAGCTGGGACCACAGGCA 

TGCACCACTGTATCTGGATAATTTTGTTTATTTTTTTTTGCAGAGAGAGG 

AGGTCTCACTATGTTTCCCAGGCTGGTTTTGAATGCCTGGGCCCAAGGGA 

TCCTCCTGCCTTGGCCTCCCAAAGTATTGGGATTACAAGCGTGAGCCACC 

ATGCCTGCCCCAAAATTATCTTATTGTTCTATACCCACTCTTCTTCTTGT 

GATGATGTGAGGTGATCCATTGCCTCCTTGATGAGATGAAGTGAGGTGAC 

TGATGTGGGCATAGTGATGCAGTGTTTAGGCTGATATTGGCCTGATGATA 

TGTCAGAAGGAGGGTCATCTGCTTCGGTGATCCTGGATCATAGAGTCATG 

ATGATGTCAATGGTTGGATGTCAGGAGCAGACGATGTCAATGACTAACGA 

TAAGCTGGACAGGTGGGATGGTGGCACAAGATTTTATCACGCTACTCAGA 

ATGGAGCACAATTTAAAACTTCTGAATTGTTrATTTTTGGAATTTTTCAT 

TAATATTTTTGGATTGCAGTTGACTGTGGGTAACTGAAACTGTGGAATGT 

GAGACTGTGGAAAAGTGAGGGAGTACTGTATTATGGAACTGTAACTCTAT 

TCGGTAGGGGAACAGAATTCACATTTGTGGGGCCCAGGTCTCTGCATCTG 

TAGGGATCCAATTGTTTCATTTCTCGTTGTAGCAAAAACTTGGCTTTGGA 

ATCAGACAGATTGATGTTTGCTATCATTCTAAATGGGTGCAGCTACACTT 

TCCTCAAGAGGTAGTTCTGAAAATTTAACAAAATGTGAATTTCTTGGTAA 

AAAAAAAAAACCTCAAAAATATTCAGTTTCCTTTCCTTTGTGTCTGATGT 

ACTCCATCAAATACTGGGAAATATGTGTCTCTCATAGAAATGTCATGGAT 

CTTTGTAATTCTGATTATCCACAAACCTTGGGGATTAGCTGTTTCAATGT 

TCCTATTTTACAGATAAGAAAATGGAGCCTGTGGTAAGTTAAGTGAGTTA 

CTCATGGCTACTTAACTAATATTTTACTAGGTGATAGGCCAGAGCTAGAG 

CCCAGGTCACCTTCTTATCAATGCTCTGCCTTGTCTCTGTGCCTTCCTGT 

CTGTCTGTATGTGTATGTGCCTGTTGACAGTAAGGCATAGTTTAACCCAG 

TAGAACTACCGGTTTGTAATGAATTCCACTTGT AAAT GACTGACCATTCA 

AGGAACAAGTGTTTTTTCTATGCTTGACACCTGTTTTGGATGCCAAAAAG 

GATACAAATGTAACTTCAGACACTCTGGGCCTCATTTTGCACTCATTAGC 

ATGTCCAAAATTAAAAAGACTGACCACACCAAATATTGGTGAGGATGTGG 

AAGAACGGGAACTTTCATACACTGCTGGTGGGGATGTAAAATGGTACAAT 

C CCTTTGGGTAACAGTTTGACAGTTTCTTAAAAAGTTAGACATATATATT 

TACCATATGACTCAGCCCTTCCACTTCTAGGTCTTTACCCAAGAGAAATG 

AAATGCTGTGCTTTTACAAATGTCTATACAGGAATGTACATAGCAACCTT 

ATTTGTCATTGCAAAAAACAGAGACAATTCAACGTTGTCAAGAGTGAATG 
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GA7GAGCAAGC7G7GG7. "JTCTATGCAr.* GG7A7CC7AC7CAGCCAG; 

AAAGA7A7GGC7AA7 

>Concig41 

GACAACAA7G7CA7GCA7AAGA7GACGA7GGCC7GGG7GA77GA7GCAAA 

CAAGGA7AAAGAAAATAA7CAA77T7G7CC CCA77T7CAAAGACAGA7AG 

CAGCAGCAAGAGTGTAAGTCTGAGGAAAGTCATATTCCTTCCTCCTACAA 

CA*5AGCACACACACTTACAAAAACAATACACAGACTCCTGGCCAATGGAC 

TTCAAAACTGAGGAGGATCATTAAATTTAAATGT7CACCGCTGCATGAAA 

TCTCCCTGGGTCCTGCCCTCCCTTCCCCACCCTCCTCCACTTGGGCCGGG 

GCACAGCAGTGATTCTCTCACCTCTCAGAGTGAGCCAGTGTTGGCTGCAT 

TGAAGGCTCCAGATATGCAAACAGGGCAGATATTCCTGGACCAGGGTGCA 

CAGAGTGAGGCTCCAACGCACCCTATTAACTGCATGAAGGATGAATGAGC 

CTCTGGTATGGGCTGGGACAGAAAAAAGGATTCAAGGGGCCCAAAAGGGT 

TTGGGTGGAACCTACCAGGAGCGGCAGTACAGACTCCTTGGGAAGGTGGC 

CATGATTTAGCCACATTCACCAATAGGATAATCTGGAGAATTTCCTAGC7 

TGAGTTTCTGGGAGAAAGCAGATTTCTGGATTATCTGGTGACAGGTAACA 

GGGCCGAGTTCATCCACAGCCACCTGCAGTGTTAGCACCTTAAGCTGAGT 

TCCTTGCACCAGGATGCTGTCACGCCCAGTCAGTGTGAGACGGTTCTTGG 

CTGAAGGAC7GAAAAGCTTGGGTAAGTGACTTCACCTAAGCCTCTATCTC 

T7GC7CCCG7AAG7CAGGGC7CA77G7GGC7CC77GCAGGC7TGAC77CA 

GGGTTAACAGAGAAAATGAAGG7ACAAG7GCCT7G7GAAC7CTGAAAC7C 

CAAACCAG7CA7TCTCAAAGTGCCGTCCACCAG7C7AGCACA7CAGCA7C 

AC7GGAAGC77G777GAAA7G7AAA77A7CAGG7CC7CCAGAGC7A7G7A 

~GAA77AGAAACTCTGGGAA7GGGGCCCTGCAA7C7A7TTCAACAGGTCC 

7CCAGG7GAT7CTGA7GCAAGTTAAAGCC7GAGAAAC7C7G7CC7ATACA 

AA7GGA7G7CAAC7CAAGCTGCTC77CAGAA7CACC7A7AGCAC7TG77C 

ACCCGAA7CCC7GAGAATGGAGC7TCAGGAC7GCTA7TTC7CAAAGT7TG 

CC7GG7GA7CC7GAGATGGGG7T7GGGGGACAGAGATCCAAGGTGCTACC 

AGGTGTGAGGAATTG7TAGAAGGCAAACCTGGCTGTCATCTAGGGTGCTT 

AAAGGG7ACAGATCC7AGGAT7CTGCCTCTTACAGC7GAATCAGACTTTC 

C m AGAA7GGGATTGC7G7CCAATGGCATGCC7CC7GGGTGACTCTGATG7 

A7AGCC7GGGC7GGGAACCACCAGAGGA7TA7CTTCCAT7GACCAAGCTG 

ACAAAC7CGCT7AAGGCTCTGAGTTTCACACTTGATTTTCTAGCCCCTGT 

CC7TCCA7GGA7CACCTGCCCCC7TCCCTCCTAATCAGGAGCACAGTCAG 

" P GGA7GCAC7AA7GTGGCCTCTCCTTGGCTGCAGGGAACAGGTGGAAATG 

7GGCCA7AGG7GTGCAGGGCTGCCTGCCATG7A7TAATAGCTACAGAT7T 

GAAAGA7CCAAGGACAAGAGACTAGAAAAAAATTTAAAACAGCCAAGCA7 

7GGCCCAG7AA7GGCATT7CAGAAA7CCACCAAAA7ATTAAGATGC7777 

- , GAAAAA7A7CCAGAGCACTCATGTAAAAGTGCTTAATTATTAA7AAAAG 

C7GACA7G7G77GGGTACTTCCTGTGGGTC7GGCAC7AGGCTAATTATG7 

"777AGGAG77GACTCAAATGCTCCCTGTCATAAT7ATGTGAAAAAATAT 

AA77ATTAGCTCCATGGTACAAATTAAGGAGAGGTTACATAAATAAAAAG 

GAATGA7AC7CAAAT7AGTAACCAGAGCCCATGCTCTTAAACACTATGCT 

A7TA777G7GGAC7CTTACATAGGTGGCAAAAGTCAAAGGCTAGATTGAC 

77C7G7CCACTTCCAGCCAAGATGAAGTACAAGATTCAGATACACCCTTC 

CGCA77AAACAACTTAGGAATCAGACAAAATATACAAAGCATTGTTTGTT 

ACACA77GGATAACAGACAGCACTAGATAGTCGTGTCTGAGAAAAGCGGT 

GAAA7GAGCTGAGTCT7AGAATTGCCCCAGTTTAC7AAGGGGCATAGTAA 

GGGCA7AGCTGCAGCACAAAGAAGCAGAACCCAACAGAGACTGGCGTTCA 

CCTGAG77GAGAAAACCAAGTTGAAAATTTAGGAACACTAACACAGATAT 

G7AGGCAAGAGTATCAGAGAGGAGACAGTTGTAGGGAAAAAGAGAGCTT7 

ACAGAGAGACAGCGAGAGC7CCAGAGACCCGCAGAAGATTGCCCTGACG7 

CAC7AGC7GAGTACCGATCAGTGCATACATGTAAGGATATTACTCAATA7 

G7GGAAAAGAACAGAAGGAATGATGTCCAAAGCTCACCCAAAGACAGGAA 

7CA777A7G777CCACCAGCCAGAGTGGAACAACCTTGTAACGCATATGG 

AGTAC7CAAACGAA7ATTTCCTCAATAA7AAGTTCAAATTAACTGAGAC7 

AAAGCC7GCCCGCT7TG7CTGGACATGCC7AACAAAGCTTTGAGGGAAGC 

C7CAAAAGAA7GAAACCG7G7CCAAGTAA7T7AAC7GTGTCC CAGAAAAA 

AA77CAAGAACA7T7AAA7AAA7A77AAAATATGATCAAACCCAGCAAGG 

T7AAA77CAAAA7G7C7GGCA7CCAT7AAAAAAT7ACCAGCC77GAAAAT 
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TGGCGGGAAAATATTA'. -JATAATGAA. .JAAAAAGCAATCAACAGA/ 

AGGCCTAGAAAGTATACATATGATAAAATTAGCAGACATTAAATC&TTAT 

GATTAATTTATTTTATATGTTAAAGAAGGTAGAGAAGAGCATAAGCACAT 

TAAAGAGAGACAGGAAAGTCCCAGTACTCACACAGGGCCAGGAGCAGTTT 

TCACCAGTCAGGTGGGAAAACTTCATATTTCATGGAGCATTGGTAGAGTA 

CACAGTGTCTTGCCTTAGTAGAGGGAXAAATGCTGTTCTGTTCCCGCCTA 

ACCCATCTTGAAAGAAAATCTGAAAGGATCAAACTGTATTCAAGTAACCT 

AATCACATCCCAGCACACAGCTCGACTAGTTATAAAAACACAAAATATTA 

ATATCTAGAAACACAAAAATAATATCTAGCACCCAACAAGGTAAAATTCA 

CAATGTCTAGCATTCAATTGAAATTTTCTAGGCCATCAAAGAAGCAGTAA 

AATATGACCTATAAGGCCGGGCACATTGGCTCATGCCTGTAATCCCAGCA 

CTCTGGGAGGCCAAGGTGGGTGGCTCACCCGGAGGTCAGGAGTTCAAGAC 

CAGCCTGGTCAACATGGTGAGACCTCATCTCTACTAAAAATATAAAAATT 

AGCCCAGCATGGTGGTGGGCGCCTGTAATCCCAGCTACTCAGGAGGTTGA 

GGCAGGAGAATCGCTTGAACCTGGGAGAAGGAGACCGCAGTGAGCCAAGA 

TGGCACCAATGCACTGCAGCCTCATTAGAGAACATCGGGAAG 

>Concig42 

GAAACTAAAGGCTTATT7AAAGCGCGAGACCGTGGCGCCTTTGGACTGGA 

CCCTTTCTAATGATCATTTAGTATCAGGCTATGTGGGAGTTGACCGTTTT 

GCATAGCCTGAAAGCCAACAGTATCACTCCTCCTCTAGGTGTGGCAGAGA 

TGTGAGAGAAGGAGACTGACAGTCTGTGGGTGTGTATGCAGTGTTGGGGG 

AAGCGAGGCACAGGGGACAATACTGTGGTGTAGAAAACTAGTCTAAGGTA 

GCATCAG GAAAT TCATGAAACCAAAATGAATT7CATAACAGCACAAGACA 

TTATTTGTTTTTGCCTCCCTCTCATTTTTTTTTTTTTTTGAAACAGAGTC 

TTGCTCTGTCATCCATGCTCGTGTGCAGTGGTGCAATCTCGGCTCACTGC 

AACCTCCACCTCCAGGGTTCAAGCAATTCTCATGCCTCAGCCTCCTGAGT 

AGCTGATTACAGGTCTGCACCACCCCGCCGGCTAGTTTTTGTATTTTTAG 

TAGAGATGGGGTTTTGTAATGTTGGCCAGGCTGCCCTGtCATTTTTTTTT 

TACTAGTGTCCAGTGGAGTTTTTTAGGGGCTACATAACATGATACTGTCA 

TTAATCTAATGGCTAATGAAAGGGATATGTATATGTTTTTGTGTTTAAAA 

CAAACTTC7TTGGGGTCCTCAATAATTTTTAAGAGTATAAAGGGGTCCTG 

AGATCAAAGAGTTTGAGTTCTGCTGGACTGGGACAGTGGTTGTCAACCCA 

GATTGTACATTAGGGTCATCTGGGAAGCTTTAAAATAGTACTGATGCCCA 

ACCTTACCGCAAACCAATTAAGCCAGAATCTCTGTGGATGAGAAGTCTTC 

ATTGTCATCATCACCATGACCATCATCATTGTCACCGTCACTACACCATT 

ATCATCATCATCATATCATCTTCATTATCATTGTTAGTATCTCCATCACC 

ATCATCAGCATCACCATTATTATCATCATCATCATCCCCACCATCATCCT 

CATCGGAACTTCACCTGCATGGAGGACAATCCACTATGCATTAGGTGCTA 

TGCTATTTGCTATACTCCTTATTCTCACAACTGCCCAGAGAGGCTGATAT 

TATCTCACTTTATAACAGGAGGAATCTGGATCGGAAAAGTTAAGGTAAGC 

TAATTCACAGAGCGAGAAGAGATAGAGCCAGGATTCGAAACCAGTTCTCT 

GCTACATCAATGTTCCCAGTCCTTGCACTATTGAGAACCTCTTTAGTTAT 

GCTTTCACCCCTCCAACACCACAGTAAAT TrriTC ' r TTTTTT A AAAAAAT 

TATACTTTAAGTTATAGGGTATATGTGCATAATGTGCAGGTTTGTTACAT 

ATGTATACATGTGCCATGTTGGTGTGCTGCACTCATTAACTCGTCATTTA 

CATTAGGTATATCTTCTAATGCTATCCCTCGCCGCTCTCCCCACCCCATG 

ACAGGCCCTGGTGTGTGATGTTCCCCACCCTGTGTCCAAGTGTTCTCATT 

GTTCAGTTCCCACCTATGAGTGAGAACATGTGGTGTTTGGTTTTCTGTCC 

TTGTGATAGTTTGCTCAGAATGATGGTTTCCAGCTTCATCCACGTCCCTA 

CAAAGGATATGAACTCATCCTTTTTTATGGCTGCATAGTATTCCATGGTG 

TATGTGTGCCACATTTTCTTAATCCAGTCTATCATTGCTGGACATTTGGG 

7TGGTTCCAAGTCTTTGCTATTGTGAATAGTGCCACAGTGAACATTCATG 

TGCATGTGTCTTTATAGCAGCATGATTTATAATCCTTTGGGTATATACCC 

AGTAATGGGATGGCTGGGTCAAATGGTATTTCTAGTTCTAGATCCTTGAG 

GAATTGCCACACTGTCTACCACAATGGTTGAATTAGTTTATAGCCCCACC 

AACAGTGTAAAAGCATTCCTATTTCTCCACATCCTCTCCAGCACCTGTTG 

TTTCGTGACTTTTTAGTGATTGCCATTCTAACTGGCACCACAGTAAATTT 

TTATAGATTTTATAAGCAAATTGTATTTACTGTGCAAGAATTGGTTTATT 

7TTTAAACCATGTGTTGCAAACATACAATGGTTAATTGTGATATTTGCTC 

AGTACAAGATCATCAGATCACTACACAGACTTGAGGTAATTCCACCTAAA 
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AGCAAAGAGAACTGACCCCACATTAACTGAGAAGTCTTTACTTATTTA'x i* 

CCCTATAAACGAGCCAATATGAAGAGAAGGCCTTAATGTGGT7AACTATG 

TAATTTTTTTCTGACTTTTTGAAATACTGAGAAGAGCTCATGACTCTCCC 

ATCTCCTAATTCTACCTTGGTGGATTTTAGACTGACCACAACTCAT GGGT 

AAATGAGGGAAGACGAATAAGAAACCTTGCTTTTTTTTCCTCCTTGTTTT 

TGGCTGGCTGCAGTGGCTCACACCTGTAATCTCATCACTTTGGGAGGCCA 

AGCTGGGAAGATCACTTGAGCTCAGGATTTCAAAACTGGCCTGGGCAACA 

TAGTGAGAC CCCATCTCTAAAAAAAAAAAAAAAAAAAAAAAAGGCGACGG 

GCGGTGCGTGCCTGTAATCCTACCTACTCAAAAAGCCGAGGTGGAAAGAT 

CACTTGAGCATGGGAGGTCAAAGCTGCAGTGAACCTTGATTGCACCACTT 

CATTCCAGCCTGGGTGACAAAGCAGGACGCTGCCTCAAAAAAACAAAAAC 

AAAACCTTAATTTTTTGGCTATTCTTTTCTGGTAAGAATGGTATAGAGAT 

GGGGATGAGGATGGCTATTGTATGAGAGAGCAAACAGGGTCCAAGCAGTG 

CTCTGGGCTGTCTAAGGACCAGTAGTCAGCTTAACTTCTCAAATTTCCAG 

GGAAGGAGTTCGGAGTGGTAGAATATCCTGGGTATGCCCAAAGCATCACC 

TTGCAAATAGCCTGTCATGAATAATTTGTTTCATTTGTTATGACTGGAAA 

CTGGCTTTGTGTATGCCAGAGAATGGGGGCAGGAAAGAGAGATTGGTGTC 

TTGAGCTCTCTGTGCCTCTGGGGCAGTGATGCTTTTCCTCTCATGTGGAA 

GGAGAGCATGACTGAAAAGGTGCACAAATAAGGTGTCTGTGAGAGAAATT 

AACCTTCGAGATACAGAGACACAACCTTCCCCAAGAGGTCCTCATTGCTC 

TGCCTTTTTTCCTTTTTTTTGCTTGTTCTACCATTAATAACAGAAACTGA 

TTATGACCTCAAAAGAGAGGAGAAAGCGACTCTCCCCACCCTAGAGCTAG 

TTAACCACCATATCTTCCTAGATCTCAGTTCAAGAGTCACTTCCATCCCC 

AATAAAAGCCCTTGAGTGCTGAGCACCTCTCCGTCATAGCATTTGTCCTA 

GGGGTTTTTGTACATTTTCTTGTGTGAAACTTGGGTTGACATCTGTATTT 

CCGACTAGATTACAGTTTCCTCAAGGGTAGGGATGTCTTGCTTGCCATTT 

TCAGTTCCAGCATCTAGACAGTACCTCAAGCAAACAAGGCCGAGGGGGGT 

GCGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGATGAACATGGTGAAA 

CCCCGTCTCTACTAAAAATATAAAAATTAGCCAGGCGTGGTGGCAGGTGC 

CTGTAATTCCAGCTACTCAGGAGTCTGAGGTAGGAGAATCGCTTGAACCC 

GGGAGGTGGAGGTTGCAGTGACCTGAGATCCACTGCACTCCAGCTTGGGT 

GACAGAGCAAGACTTCGTCTCAAAAAAAAAAAAAAAAAAAGAAAGAGAAA 

AGAACATCAAATGAATGAATGAGTGAGATG AATGAG TTAGCAGTGTTGGA 

TTTAAGTGTCAGATTCTTCCCAGCTTGACTTTTTTCTTTGGCTTAGTGAT 

TTTGAGGTCNCAAGATTTATTTTCCTTTCACAAAGGTGATCACTACCATA 

AGATCTTCAGAAAAAGAATGTGGCAAGCCANGTCTCACTAATGCAAATCT 

CTATAACAACTGTATCAGTACT 
>Concia43 

GAGGTGTCATAAATATGGACCGATAGATGAATACAGGTAGGATGGGACAC 

AATCTAAGATCCCAGGGGGGGGAGACCACACGCTTGGTTAGGGAGACCCA 

AAGTGGACCGTGTGGCCAGAAGAGTCCCGCACTGCACTCTAGTGACAGTG 

CAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTGCTATTTCAT 

CATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTGTTGGGAAAA 

GCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTTCCACTCTCT 

CACCTCCACCCAGGCGTCCAGGTGTGGAGQACACTTGCCGGCTGCTTGTC 

TGCTGACTCATCCCTTGGTTTCACTTGGAAAACCTACCACCAGCTGGCCT 

CTTTCCAAGCATCAGCCTCCTCATTTTCTTAATCCCTTAGGTGTGATCTC 

ACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATAT GAATA AAAA 

TGATTATTTTGTCATCTTCCAATCTTCCTTTTAAAATATTATTTTATAAT 

TCCCTTTAGGAGGATCACCTAAGTGAAGACTATTTTTACCTAAGAAATGT 

TAAAATGTAAAGACATGGTTGTAATCTGGGGATTCCTGTTAAAATGGCTA 

GCAGACAGAAGTCAGACGACAGGCTAGAAATGT GTGAA GAGTG GTTG CCT 

TTGAAAGGCGGAGTTGGTAATGATTTTCTTCCATTTTTCCATGCTTTCCA 

ATTCTCTACAAAGGCCTTAATATTACTTCGATAACCAGGACCTCTGATAA 

^CTGCCCCCACCGAGTAAAGACTTAGCTGGGAAAGTCAGCTTCATGTGAG 

GTAAAAGGAACCAGGTAATACACAATTCCCACTGCCAACTGTCGGGTGTG 

CAGGCCTGAGCTTCCTGCATGTGGGAGGAAAGAGAAAGAAGAGAGAAACT 

CCAAGATCCAAGAGATCCAGCAAGAAGGCTGGAGTCTGAGGACGCAGAAA 

GCTGAATGGCACAGTTACCACTATTGTGCTGAGGTTCTGTGGCC7CTGGG 

TCTCTTGACAACTGGGCAAAGACCCACAGAAAACTATCTCTAGACCCTAC 
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CTGTGGGAGGGGAAAG1 - JTTCAGATCA a CTACAGGACAGCCACCTGGk-. 

CTCAAATGGCTTACAGTTCCTTCATCCAGAGGGTCTTCATCTAGTACATA 

CCAGGTGCTAAGCCTGGGTGCTGGAGACATGACGGGGAACCCATTTACCA 

TGGCTTTGTTACTGTGACATTCACATCTAGGGAAAGCCAGCAAAGGGGAG 

GGATCGAGGAGAGCTTGTTAGGCAGAGAAAATACCCAAGGGCAAGGGAGA 

AGCCAGCCTGTTCTGAGCACACACAGTGGTTCCATCTAACTGGGCCTCAG 

TQ£CAGGT7GGACTGGAGATGGGGCTGAGGAGCTGTCACAGAGCATTCTG 

GACACAGATGTC ACATA GTCCCTTGAGGTTAGGGTCCTTAGGCATGGCAG 

CATTGCTTTGAGTTTTTCCTTTTGTAATGTTGCCATTCATGACAATGTGG 

AAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGTTAGGAGAC 

7AAGATGTGAGCCAAGGAAAATGAGGAACACCTGAACACTGGGGCAGGTG 

CAGGGC CCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAGTAGGTAGAA 

TCAAGGCAGCTGGTAAAGATCTTTTATTACATATAAACTGGAATAAGCCA 

TCTGCTCCAAGACAAAAGAGTAGGCGGAAAACAATACAAGACAGAAATGG 

AATTAGAACAAACCTGGGAGGAATGTGGAATTAGAGTAGAGAGTCCAACA 

CTGGCTGCAATCATAAAAATGTAAAACAAACAAAAATTTGCTAGGTGTGC 

TTACTTAGAAATAATTAGCTGTCATATTAAGTTCACTTGTGTTATGGCTT 

AAATGTGTCCCCCAAAATGTGATGTGTTGGAAACTTGATCCCCAATGCAA 

CAGAGTTGAGAGATGGGACCTTTAAAAGGTGATTAGGTCATAAGGGTTCT 

GCCCTCATAAATGAATTAATACTGTTATCATGAGAGTAGATTCCTGATAA 

AAGGATGATCTCTGCCTCCTCCCCACAGCCCTCTTGTGCATGCTTTCCTG 

CCTTTCCACCTTCTGCTATGGGATGACACAGCAAGAAGGCCCTCACCAGA 

TGCAGCTCCTTGATCTTGGACTTTCCAGCCTCCAGAACTGTAAGCCAAAC 

AAATTTCTGTTTATTATAAATTACCCAGTCTCAGGTATTCTGTTCTAGAA 

GCACAAAATGGACTAAGATCATTAGATTATCATTTTTT A TCAGACTGTTG 

AAGTGAAAAATAAAAATCAAATAAAGAAATTAAGAGAGCTGCATGCAGCA 

GCTCATGCCTATAATCCCAGGAC7TTGGGAGGCCAAGGCAGGTGGATTGC 

CTGAGCTCAGGAGTTTCAGACCAGCCTGGGCAACACGGTGAAACCCTGTT 

TCTACTAAAATACAAAAAACTAGGCCGGGCGCGGTGGCTCACGTCTGTAA 

TCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCATGAGGTCAGGAGATC 

GAGACCATCCTGGCTAACAAGGTGAAACCCCGTCTCTACTAAAAATACAA 

AAAAAATTAGCCGGGCGCGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGG 

GAGGCTGAGGCAGGAGAATGGCGTGAAACCCGGGAAGCGGAGCTTGCAGT 

GAGCCGAGATTGCGCCACTGCAGTCCGCAGTCCCGCCTGGGCGACAGAGC 

GAGACTCCGTCTCAAAAAAAAAAAAAAAAACTAGCCAGGCATGGTGGTGT 

GTGCCTATAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGA 

ACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATCATACCACTGCACTCCA 

ATCCAGCCTGGGTGACAAAGCAAGACTACATTTCAAAAAAAAAAAGAAAG 

AAAAAGAAAAAAAGAAAAGAAAAAGAAATTAAGAGAAGGGCAGGTA7TAA 

CCCCAAATATCCCACCATAGGGACACATTAAAGTTTGCTTGGCCACTCCC 

CTAGCA7AATATATGGAATGTCTTCAAGGACCCTCTGTTGTAAATACAAG 

GCCCTGCTGGACTTAATACAACCTGCAGGCTTTGAGATCCCTACTCTGTT 

GCCATCTCTCATAGGATTTGCAGACCAAATCCAAATACTTAAAATAGCAA 

CACTCACAAACATGCAAATCAGAGCAGAAAAGAAACTTCTAAAAGGCCCT 

GAAACTACACTTTATGAGAGAAGACAATAGGGACCTGAGGGTGGTAGAAT 

TTTCTCTCTATGCATCTATGTTTCCAGGGCTCACTTTCTGVATAAACTCT 

TAAATTGC'i'T 1 TAAAGTAAGGGAACAAGCAAACATTACATTTAAGAGAAA 

TCAATTTCATAAAGAAGGGGGGATGTCCAGGGTACTTTGCTTCCATGTTT 

TGCTTCCATGAATTTGTGTTTAACAGAAGATGCAGAAAAACACACAATTA 

TTGCAAAATCAAGGAAATCCACTCTAAACATCCCTTGGTTTCCCAGGCCA 

GTGTCACAACTGAAAACACATATTGTGGCTAATTATGTGTCACAAATTAG 

AATGACAAGGCAAGAAAAAAAAAACTCTCTGATTAACTAATAGCAGCCAA 

CACAGACAGCCTGTGTAGCTCGACTCTGCTGGTTTATAAAAGGCAGAAGA 

AGCAAACGGCTTCTGTGACCGCAACAGGAAGGGCCTCTGCTCTTAATAAA 

TAAATAACATTTAAATTATTCTCCCCCATTTGCAAAGCATTTTCCAACTC 

ATTATCTCATCTGACCAGGTATTATTGTATCTGACCAAGAACTTGTATAC 

NAAATAAAGAATAAAAAATAAATATGGGCCANGCACAGTGGCTCATGCTT 

GTAATCCCANCACTTTGGGAGGCCCAGGCGGGTGGATCACTTGAGGTCAG 

TAGTTTGAGACCAGTCTGGCCGACATGGCGAAACCCCGTCTCTACTAAAA 

ATACAAAAATTAGCCCGGCATGGTGGCACATGCCTGTAATCCCAACTACT 
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TGGGAGGCTGAGGCACGa>3AATTGCTTGAACTCGAGAGGCGGAGGTTGCA 

GTGAGCCGAGACTGCGGCCATTGCCCTCCAGCCTGGGCGATGAGAGCGAA 

ACTTCATCGAAAAAACAAAAACAAAACAAAACAAAAAACACCTTAGAAGA 

AGCGTTCCTCCTCTTGCTTTCTGAAGACACTCTACGCTGAAACAGTAACT 

TTCAATAAACCATCTCTTCTCACCGCACTCTGCGACTTGCCTTGAATTCC 

TT TGT GTGCAAGATCCAATAAGCCTCTCTTGCGGTCTGGATGAGAACC CT 

TTTTTTGGAATACTCTGACACAACAAATTGCAGAAAGAAAGTCTCACATG 

TATAAAATAAGCAAAAAGATTCTCTGGCATCTGAAGAAACAATTTCCTTG 

TCAATATTAGTATCACTATAAGTGTAGAACAACCTGTTGTATGATGCTAC 

ATAAAGTATATGAATCTGAATACTGTTGGATACAAAGGGAGACTATNNAA 

TGTAATACGTCGCCCGAAATGACTACACTGTTGGTGATCTTTCTTTCAAG 

AAGCANAATATTGCCTCNAACATCCTGTACATGGTATAAAATTTTA 

5Concig44 

CCCAGCAAGAACACCAATACAACGGGGGGGGCGTTCTTTGTGAGGGGTGG 

GGAGGTCAATTTTTTGGAACCTGCAGCAGGTAACACACAAAACTTCCACA 

GCTGCTACCAGCTTTCCAGGAGAGCCTGTGTACCTGGAGAGGRGAAGGCA 

AGTGCTTCCGAACTTGACTTGATGTCTTAGATTCTGCAATGCGTAGTCTG 

TAGGGACAGGCTGTAGCTTATCCTATAGGCTTGGGCTGGAGTCAGCAAGC 

ATCTGGGCTGGCAGAAGATAAAAGATGCAAAGGTGGAGGAAAGCATACGT 

GGTCTGGAAGACAGACTTGGTGGGTGGGTGGCTGCTACAACACCCTAGTT 

AGAGGTAGAGGGGTAAGTCAGTGTGTCTTCTGCACAGGCCTCTTCCCCAC 

CTCATTCTTCATTTCCCATACAGCCTTGCTGAGTTATTCACAAACATCTG 

ATTCAAC7GGAAGCTGGGTTGAGGATGACCTAAAGGACTAGTGTGATGCC 

TGCCCAGGGGTGTGGGCCCATAGTCAGAGTCCAGAGCCTCCTCTCAGCTT 

TTAGCACATCTCACCCACATCCTGGGTCCTTAATTAGCAATATGAAAGCA 

AGCCAAGTGACAAGATTTTGTCCCTGGGAAGTCCAGAAGCACTCCTTTTC 

TCATTTGTATAAGCATAATGATTTGCTTACATAAATAATCATGAAAATTC 

AAATCCCTCTCAGAAATCAGGTCATAAAACCATGAAATGCAGCATGTGGG 

CAAGAATCAGAGGGAAAGGTAGGTCTTGGAAAAGAAAGGATGGCAGGGAG 

GAAGAAAGCAGGGTGCCAGGGGCCCTGGGCTGCTGTCCAAGTCAGGTGGC 

TCACCGTCTCTGAGAACATTTCACTTTCTGGTAAATGGGGCAGTTGGAGA 

TAGAAGGGTTGGGTGAATGCCAAGAGTGAGCACAGCTGAGGTCAGTGCTG 

TGCCTGCAGTCCAGGCGGGAGTAGAAATCCTGGGCCCATCTTACCTCCGA 

CCTCATTTCCTCCTCTGTAATAATGTGGGGGTGGGGGAAAGTTCTGGTCA 

TCAGCCCTAGCATTCCATGGTTCATTTCCTCATCAGTGATGGAAAATCAC 

CAAGCAAGAGAACAGGATGGAGAATAACCGGATGGGTGCAATCGGAGGTG 

CTATTTCAGGTGAGGTGGCCAGGGAAGGCCCTCTGAAAGGGTGGCTTGAG 

CAGGTGGCTGAATGTACAGAAGCTGCCAATCATGAAAGATCTGGGGTACA 

GCATGCCAAGCAGAGGAAATGCGAGTGCAAAGGCCCCGAGATTGGATGTG 

GGCTTAGCACAAATGTGGCATGGCAAGAAGGCCAGTGTGGCTGAAGCAGC 

ATGAACAATGGGTGGAGGGGCTGAGAGGACAGAGGAGCAGGAAAGAGCCA 

GGCTTGGGTAGGAGAGGTGTCAACTTGATATATGATGCAAAGCCCTTGGA 

GGTTCCCAAACACAAAAGCAATGATCTAATATATGGTTTTAAAAATGCCA 

CTCTTGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAG 

GCCGAGGCGGGTGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAA 

CZAAGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGCG 

GTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAAT 

GGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCCACTG 

CAGTCCGCAGTCCGGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAA 

AAAAAAAAAAAAAAATGCCACTCTTGCTGTGAAAAATTGACCCTGGGGGA 

AGGAGGAGTAGAAATGTCAAAAGTGGAAGCAGACCACTCAGGAGGTCAGG 

GCAATGGACTGTGCAGGAGAGACTGACATCTTAGACTCGGGCAATAGGAG 

AGAAGGTGGTGAGGATTATATTCTGGGCATAAAGGCAACAGAACTAGCTG 

ATGGCGTCAACGTAGGAGATGAGGGAAAGAAAGAAATCAAAGGGCATTCA 

TAGGT7TGAGGGTTGAGTAACTGGGGATATTTAACAGAAATGGAGAAGTC 

TGGGGAAGGGGCAAGTATTGTGGGGGCAGGGGTCAAAAGTTCTGTATTTT 

GGCCAAGTTAATTAATATTTGAGATACCTCTTAGGTGTCCAAGTGAAGAT 

GTCAAACAGTCAATTGAATACAAAATCTGAATCTTAGCCCAGGATGGTCT 

CACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGAGGATCACTTG 

AGGCCAGGAGTTTGTGATCAGCCTGGGCAATAGAGCAAGACCCTGTCTCC 
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ACACACACACACACACA. AAAAAGTCA'i wCAGGCATGGTGGCACATGC- * 

GTAGTCCCAGCTACTCAGGAAGCTGAGGCAGGAGGATCACTTGAGCCCAT 

GGTTCAAGGCTGCAGTGAGCTATAATCACATCACTCAATACTACACTCCA 

GCCTGGATGACAGAGAGAGACCTCATTTATTAAAATAAAATTTAAAAAAA 

TTAATTAAAAATAAATCCAAATCTTTCCTGAGATTCATATTCAGGAGTAA 

C TGTCATGTAGAAGGCATATAATGCCATGGGTCACATGAT AC CATCTAAT 

GAATGCCACTGGAAAAGAGAGAATAGCTAAAAACTGAGCACTGGGCACAC 

CAGCACAGTGAGGTTGGAAGGAAGAAATGGAGCTAACAAAGGAGACAAAA 

GAGGAGTAGCCAGTGAGAAGAGAGAAACATCTGGAGAGAAGAGAGAGCAG 

CAAAAGGTGGGTGAAGGAGAATGTGGTCCACCAGGCCCAACAATGCTGAG 

CAGTTGAGTAAGTGAGGACCTGGCCACTGAATTTGGCAAGAAAGAGGATG 

TCAGCGGCCCTAGAACAAAAGTGAAGAAGAGCTTGAGGACGGAAGCCTGA 

CAGGAGTGAACTGAGGAGAGAATGAAAGGTGGAGACATGGAGCCAAGGAG 

CACTGAGACTCCCTTGAGTAGTTTTGCTGTAAAATAAAAGTGAGTGCAGA 

GACGGGGCAGGGGGACAGAGAAATGCAGGGGTAGCTGGAGGGAGCCACAG 

AATCAAAAGAGGGTTTTTGTGTTTAAGATGGTAGTTGTCACJWTAGCACAT 

TAGTAAGTTCATGTGAATCACAACGTAGGTGAGACAGATCACTAATGCAG 

GAGTCAAATCCTTGCAGAGCCCCCAGAGGAGGTGATGAAGGGAAGTGATG 

GACAT.CATTCAGATGCAAGTAGGTTAGCAATTCCTGGGGTACAAATAGGA 

GGTGACTCCTTTCTGATTGCTCCTGTTTTCTGAATGAGATAGCACATAAA 

GTCCACTCAGCCATGTTAGCTGTTGAAGTCCTTGTGGCTGTCATGCCTGT 

ACAGACTGGGCTCTCCTCTCCAGCATTTCCTCTCAGACTAAGCTGAGCTG 

CACTAGCCGCTGCCACATCCTCTTGGGGCCATCCTCTGCCACACTCCACA 

TATTGCTGTGGTTTGCTTGCAACCCCTGGAAGGTCCTACTGGCTGCTCCT 

AGAAGAGTCTGGGCGGCATCTCTCCCTTACTCGTTATCACATGGTGCTGT 

AAGCAGTGGCCACACACTTTAGCTGGTGGGATGGGCCATCACAGGCAGTA 

AATGCGAAAGACTGCTCAGATTTTAAAGCACCCATGAATCAGTAGAATGA 

GTTTAGAATTGTAGTCATCAACACACATTAAAAAAAAAAAAAACAGGCAC 

TAAAAAAATTAGTTGAGTAGGATAAAGCCATAAAAGATATTAACTACAAC 

CCAGATAGGAGGTGCAAAATTGTCCTTACATAAATCAGATGGAAAAAGTT 

GAAAGCAGATAAGATAAAATAGGTAAGCATGACATTTAAAAGGTATTCAT 

GGGACGTGGTTACAAAACCAACTCACAACTAAAAAGTCTTAGGACCTCTC 

GCTGACTTAGGAGCCTGATCCCAACTTTGAGAATGACTCAGTGTGTTACC 

CTGTGGCTAGTGTAGACCAATGATCCTGTCTCAGAGTCACTAGCCAACAG 

CCCATATCAAGTAATTGAAACTTTGACTCAGAAACCTCAGTGTCAGAACC 

TTTGACTTAGGAACCACCTGTAGTGGTTAACTGCAATTTGCACCCCTTAG 

TTCAGGGCTTTACAACACCGGGGGCGGGGAGGGGGAAGGCATAGAGCTGA 

TGACCTAAAGGAAACCCATTGCAGCAACGCTTTTGTGTTAAGTTTACAAA 

TAAGTGT7GTTTTAGAATCCTCCAGGTAATGCCTTTGTTATTTAATGTG7 

CTGAGACAATTCTGCACATTAAAGAATATAAAA TATTA CCTTGTAATTCC 

AATTTGAAATGTGTAATTGACATTAGACTTCTATTTTAATTTGAAATGTC 

TAAAACAATGT GGTT^G T TTGTAAAAGGTGTGT GAATrTTGAGTCTGAT 

TTACTACATTTTTTTTTAATTTTCTTTTTTTTTGGAG 

7TAGATGGCTAGAAAGATCGCTAGGCACATGTCC 

>Concig45 _ 

GATGTGTGTACGTGTGTGCAAATACCGTGCCTTTTTTGTTTTCTTTTGTT 
GAAACAGAGTCTCACTCTGTCGCCCAGGCTAGAATGTAGTGGCGTGATGT 
CAGCTCACTGCAACCTCCGCCTCCCAGGTTCCAGTGATTCTCCCGCCTCA 
GCCTCCCAAGTAACTGGGATTAGAGGCGCCGACCAACACGCCCAGCTAAT 
TTTTGTA T ' 1 ' 1 1 IA GTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTC 
TCTAACTCCTGACCTCGAGATCCACCCACCTCGACCTCCCAAAGTGCTGG 
GATTACAGGCATGAGCCACCATGCCTGGCCAATACTGTGCCATTTTATTA 
TCAGGGACTTGAGCATCCATGGATTTTGGCATCCATAGGGGTCCTGTAAC 
CAATACTGGACAAATACCAAGGGACAACTGTATTCTAAAAAGACCAAAAA 
TTAATAAGCAGGACGCTGAAGGTAATTGCCCCAATAAAGTCATGATCCCT 
TGCCCAGTGTCTGAACCTCAGCCAGTTTTCATACTCAGGACCTATTGGCT 
GCAGAGGTGGTAGGAACCATATGAGAATCCTGCAATATCATGGCAAGTAT 
GCACTTTAATGATATCTGCAGTCCTTCCCCAAAAGGACCTTACATTTACC 
ATACTGCTATGTCCTGCGTGAGAGGGTAATACTCAGATTTTTTTTTTTTT 
TTTTTTTACACAACGTCTTACTGTGTTGCCCACACTGGAGTGCACTGGCT 
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Z GAT C77AGC7CACTG C .C77C7G771 _ JTGGGCTCAAGTGATTCTC: . 

GCCTCAGTTTCCTGAGTAGCTGGGATTACAGGCGCCCGCCACCATGCCTG 

GCTAATTTTTGTATTTTTAGTAGAGACGGAGTTTTGCCATGTTGGCCAGG 

CTGGTCTTGAACTCCTGACCTCATGTGATCCGC7GGCCTCCCAAAGTGCT 

GAGATTCCAGCGTGCGCGGCCATACCCGGCCGGGAATTCTTTATATATTC 

TGAAAAC7AATCCTTTGTGAGACATAAG7GT7G7AAA7ATTGTATC CCAG 

TTTGTGGCATGTATTTTTAATTTTTAATGGTGTCTCTCAATGAAAAAAGC 

T7AACAC77AAA7GAGGTCAAAT7GA7CACC7TT7 T AT77ATGGTTGATT 

CCTTTGGTGTCATGTGTAAGGAATGTTGTTCCTTCCTGTCCCAAAGTTGC 

AAAGATT7C77G7G7A7TTTG7CCTAAAAGTTT7AAAGT7T7GCTT7TCC 

CATCTGTGCACATTTCACATTTGCTACATCTCACTGACTGCTTCCTCTGC 

TGCAGAGCAAGCTCCATGAGAGCAGGAGGCATGGGTCC7GC7TCTTGTTG 

GTCCCCAGAGCCCTATGTCATGACTAGGACCTGGCAGGGGACTAGTGAGT 

AGCTCCTGACTAACTGACTCAATGAATGAATGATTGGATGATTGAACAAA 

GTGGTATGGGAGTTCACAGCGAGTAAGAGATGCCTTAGAAGAGATGAAGA 

AGGAGATGGTATAGGGTAGTGGTTCTCAATTCTGGGTCCATG5TGGACTC 

ACCTGGGGACCCTTAAAATGTACCGTGGAGGATCCCAGCCCAAGAGATTC 

TGTATGACTGGTCTAAGATGTGGTCTGGGCACCAGGTGATCCCAGTGTGC 

AGCCAGGCCTGAGGCCACTGGATTTGGTGGTAAATGAGGTAACTATCAAG 

GGTACAGACGTTGGTTGCCAACAGGCTTGGGCTTGAATTTAAGCTTTGTC 

ACTGACTTGCTGTGTCCTCCTGCACTCGTTGAGCCTGTTTTCTCAGCTGA 

GAGATGGGTGTGATAACACCTACCTGCTGTAGTTGTTGTGAGAGTTAGAG 

GAGATAAGCATGTTCCTGGAATGAAGTGTGTTCTTAATCCATCATAGGTT 

TTTTGCTTG7TTGTTTGTTTGTTTGTTTGTTTTTTCCTTTTCAAGAATGA 

GGTTGAGCCAGACTTTGACAGCTGGGTGGGAAGTGAACATGTGGTGATTG 

GGAGAGAAGGGCAGTTTATGTGAAGGGAATGTAATAATTAGAGAGTGGGC 

GTGGGAAGACATGCTGGGGAGAGTGAGCAGGCCGGTTAGCCCTGGTAGAG 

GGTGCAAGAGAGCAGTGCGGAATCTGCCAGGGAGACAGGTGGGTGACCAG 

GGTGCCAAGGGTGTGGCTrTTCCCAGGTTCCCATGGACACAGCCATCCTC 

CCAGATGCCCAGCCTAGCTGTGAGTGAGCAAGAGTTCTGGATTGTCTCTC 

TCACTCTGTCTTTTTCTCTCATTCCAGAAACAAAGCAGTGACTGGTACTT 

AGGAGGAGAATCAGGTCAAGTTGGGAGAAACTTGCTTCTGCTCAGGGGAG 

CAGAAGCAAGAATGGAGGCCCCACCCATGCTGGAAGATGATGAGGGTTTT 

GGTTCAGGGAGGAGGAATATTGGGGATCTAAAGGGGCCTGGGAGTGGGGC 

AGGACCCTGCCTTAGGACAGGTAGAAACATTTTCTATAAAAAATGGGGTG 

GAGGTTGATGGTAGGACCAGGCATCTTTAGTTGGCTCCCTGGAGTGTCAA 

GCCC77GAGA7GGTCTTTAAAAGCCATGCAG7GGGG7TTGAATCTGGTGT 

7CAAGC7CATAGGTTATTAACATAATGACACTTGGAAACTAT7TGGGAGA 

3CTCAAGTGAGTGGCCTGGAAGTTCTGTGTTGGTGCAGGAGGTGACTTAG 

GATGTGCTGCTCCAGACTCATATCTTTGACTGCACACCTGATGCTTCATC 

TGGCTATCC7GTAAGCACCTTCAACTTAACATGTCCTACACAGAACTCTT 

GATATTCC7GTTCCTCCCCCAGTTCCTCAGTTCTTACCAAATGTTCTTCC 

AGTTACCCAATTGCTCAAGTAAAAAATCTAAGTCCTTCTCTTGGATTTCT 

GCCTGTTCCCTCAACATCCCACCTATCCATGAGTGTTCTGTGGGCCCTGC 

CTC7GAAATAAATCCTGCCTTTGTCTCCCAGTTCAC7CCAGCCACCCATC 

CTGGGGC7GCACCCTCCTCCTTCCAAGCCCTC7CCC7TTCCTTCCTGGTG 

CTGCC7G7CATGTCAAGCATATGCATCAGTGCGACCAGGACATTTGAAAT 

GCAACCAGTACAATTGGGCGCGGTTATGCCTACCAG7TTTTCTTCCTTAA 

ACA7777ATA77TA7G7T7GAAAGCA7GCCACCTTTC77CAC77GCCAAC 

T7GACAGATTTATTAGTTGACAACATCCGCTGATAGCATCAGTAATAAGT 

TAAT7G7TTTTGCACATGTAGCTTTAATTATTCTCA7TATCATTTATAGG 

AGTTATTCTTTGTAAAGGGTAACTGAGTTTTCCAAAACAAACAGAAATTT 

GGGGTGGGCCCATGGAGCGTGACTCATGAAATCAGATTCTTAGAAGGACC 

TCGGCAAG7CTCTGGGTTGCTGTTAATGAGCCTGGCTGGCTGCCAGGGGT 

G7GTCTGCCC7TTATGAGGCCACCACTGTTCAAATGCTTGCCTGCAGCAT 

7ACTTGCC7AGGTAGTGCTTGTTTC7ACTGAAC7GTCAGGGATCCAATTC 

77TG7GG7CTAAGTAACAATACTCAGATTCACAAGGAATTGATTAATAAG 

CCAGAA7GCCAATGTATTACATTT7TGATGAAGACCATATT7ACAG7GAT 

7G7ATC7GC7CAAGCTCAAATTAGGATTAGAGT7C7GACAAATACATA7G 

7GAGAAG7ATGAGGTTAAA7ACT7GAAATTTGGACTT77C7AGAAAATC7 
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GAATGTGATTGCCATTCACATACCTTTCTGGGGATGATGATTCTTGTACT 
TTTATTTTAAAAGACATAGAAAACTAACTTAAGAATCAGATTGCTTGGCT 
GGGCACAGTGGCTCATGCCTGTAATGCCAGCACTTTGGGAGGCCAAGGTG 
AGTGGATTGCTTGAGCTCAGGAGTTTGAGATCAGCCTGGGCAACATGGTG 
AAATC C CATCTCTACCAAAAATACAAAAAAAAAAAAAAAAACAAC CAAAA 
AGAATAAATTAGCTAGGTGTGATGGTGCGTGCTTGTAGTTCCAGCTACTT 
GGOAGGATGAGGTGGAAGAATTGCTTGAGCCCAGGAGGTGGAGGTTTCAG 
TGAGCTGGGGTTGCAACAGTGTACTCCAGCCTGGGCGATAGAGTGAGACT 
CCGTCTCAAAAAAAAAAAAATCAGATTGCTTTATTGCTGGTTTTCTTTCT 
AAAAC7GAGATTGGGTCCCATCATCCCCTGGCCCCCATTGGTTAATGGTT 
CCTCCTTTGTCTATTGAATAAAATACAGATGTCTGCTTTTGGCAAGVTGG 
TTGAATGTAGACACTGCAGGGTCTTCCTGACTCAAAATGATTTAGGCTTA 
GATAAAACACATTTGGAAATGCATTTCTGGATTAACACCAAGGAAAGGAG 
ATCTCTTTAAATCCCCTTTCTGTTCCCCCCTCCCTACCCCCTCCAATTGG 
GCTTAAGTAAGAAGGGTGGTTACCCGCTAGTAAACCCCCTTCGAAGGGGG 
TCTTCTCCTCTAAGGGAAAACCCTTGTTTTGACATTTGCTTCKATGGGCC 
CTTGTATTTTGTTCCTTGCTAAACGGGTGCTAAACCAGGGGCCTCCTCTT 

>Concig46 

AAGGCTTTTAGAATATTTGCACACTTTAGAAATGGAAATGTTTTTGGGGG 

GCGAGTTGTCTTAATATTTCATTTTTCTAGCTTGTGTGACATCCTTTTGA 

AAGCAGCAATTCTGGCCTTTGTGAGAGATGGTGAATGCCTGCAGGTGTGT 

GGACCAG7GCGTCCCTTCCTTCCTACATGCACGGCCCCCAGCTGGGCCCA 

GCAGAGTGCTGTTACAGAATAATTTCCAAGGGCTGTGTCTCTAACCTTTG 

GTCTTGTCCCCCATTGCTGTAGATTTGGCCAATTGACTTCATAAGTGCCT 

CTTATGAACATAGATGTTGGCAATGGAAGTTGAGGACCAGTCAGTGGTTG 

TTTTATTGAACACACAGCGTAAATCCCAACACAATGCTGACCTAAGAGAA 

TTCCAGCCACTCTGATTCTCAGTCTCTTTATATCTGAAAGGGTTCTGTTC 

CACTTTTTCCCAGATCAAAATGTCCCTGCAGCTACTCAGCAGAGCTGTCG 

CAACTTATACGTAGAAGAGGTAACAGTCCACAAACAGAAAGGCACAGGAC 

GAGAGTGGTCTGGGTGATGCTTCCTGTGGGGGAAAAGGTGATGAGGGTGC 

ATCTGCACACCTATGTTCATAGGTAAGTCTGGGAGGAGGTGACCTCCCCT 

TTGGTTGAGGTGCTGAGGCGTCTTGTTAGAATGGCACTATTCCATTTATC 

TGATGCAGTCTGTGGGAATTTTGTGGTATGGCCACCACAGGTACCATGCT 

GGGAACAATGCCAGATACTGCCTGCTAAGCCACAGCATGAGTCACATGAG 

CATTTGTGGGCTTTGGGAACTAAAGTTATTGAACGATAGTTATCTGAAAA 

GGAATTTAGGGAAAGGGGACTTTAGTCCAGCGAACAGTTTGCAAACCAGG 

GGGAAGGCAGCCTTCAGCGTAAAATGAAGACGTGTGTGCCCCAAATAACA 

AAGGGAGAGTTTGTCTTTTAGAGAGTAAATGTCCACGCAAGGTTCCACTT 

AGGCAAATGAAAGATGCAAACTTGCTTAGTTCTGATTTGTTTACATTTGC 

TGAATTCGGATTGGTCCGTGCAGGCTTTTCTGGGAACTCCAAATACATGT 

ATGACCTCTAGTCATACATGGCAAATGGCCGCTTGGCTCTAATTTGAATT 

TAGGCCCAGTTAGTCACTCAGGATTAACCTTTTTCAGGGTTCACAGCTCT 

GAACAATGG ACTTA GACCTGCAGGACATAATCTGTTCCTAACTCTGGGAC 

TACCTGTGCCTTTTGACTGTGCCCAGTGAGCAGCTGTGGCTCTGGGCCCA 

GACCCACAGGGCGATAAGGCACAGAGGTACGGATGGAGCAGGCTGTCCTT 

GCTGAGTGATCATGAAGATACACTTACATAGAGCAGCACTTTTCCTTCCA 

GTCTTTGTGATTTAACTCATTAGATCCTTATAACAAGAGTCAGTCCTCTA 

TTTAACCCATGAAGCACAGGTGGAGTCCAAGCTTAGTTTGTGAAGGATGA 

GCCAAAAGGATTCTTCTCTTGTAGACCTCAAGCTCAGCTCTCTCCATGGG 

CCCTGGAGTAGGTGAGAAGGCCTCTGTCTTCCAGAGCCCACTGCCAATCA 

TCTACATTTTCTGTTAGCCCAATTCTAGGACATTGCTTTACCAACTGAAG 

GGTGAGAACTATCATAAGTTATAAAAATCAATTGAAAAACAAAAAGGTAC 

AGAACAGAAAATAAAAGATGAGAATCTATTAAACATAGTGATGTTACTGG 

AAAAGGGGGTCTCAAACCAGACCCCAAGAGAGAGTCCTTGGATTTCACAC 

AGGAAAGAACTCAAGGTGAGTTGCAGGGTGCGGTGAATTGAGAGAGTTTA 

TTGAAAGCTATTCCATTACAAAGTAGAGCATCCTCAGACAGCAAGTGGAG 

GAACATGCCATCATTAAATTTTTCTTATATAGGAATCTTGTCTATATAAA 

GACTAAACTAAGCTGTGGCTATGTGTGGGTGGGCCGACAGCATGAAAACA 

T7TATTCT CC7ATTGATTTAAAGAGAACTATCCTTGACATTTTAGTGTGT 
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TTAAGTACATCAAAGCAl^CTATAATTA'rCTTGAAAGCATATATtTTTA 

7AGGGATTGGGACATCTGGGCTT7CTGTTGTTGTAGAAGTTTGTCC7TGC 

AGGGATTACCAAGCCACTTCCTTAGCTGTAAACATCTTAGGGCCATGGG7 

CCTGAC7GGCAAGGAATGTG7CT7GC7AG77TTAAGA7GGGC77GAT77G 

AAAATGG7G7CCATCTGGCTCTCCTAGGC7CCTGC7TTCC7AACAGTAAG 

GGTAAATGCTATGTTATGAAATGTCATTTCTGCCTTTAGCTTGCAAACTC 

77GA7GG7GAAA77C7CCTGTCCGT77TCAGTGGGG7ATTTAT7CTGCA7 

CCACG7C77CACAAGGAGCTGAAAACAAAT7GGA7GGAAGCAACTGGG77 

rTATGGGACACGTTAATGTTTTAATGTCATTTGGTGTGGAATTCAGATGT 

CCAAGCAACATTTTACACTACAAATCTGCAACTTTAATAATCACTCAAAG 

7ACC7GAACC7CAATGCTTTCAGACAGACTTGGTATAAAGCCACCACC7C 

TT7C7AT7A7GGCAGCCCTATCCTGAGGACACAAATTTCTGCAGGGC77C 

TGGCATATCTCTGATTAAACAAATGTCAACAAGGTTAAAACAAATGTCAT 

CTCTGATTTGTTTGTTTTAAAGCCTGGATTTACTCATTGAATATTTCACT 

CCTACTAGCATGTCTTGTAGTAGTTTTCTTCAGGGACCCTAATTATTGCT 

ATTAAAAATATGTGTGCAGCTACATGTTTTTTTTTTATCAATTTGCAATG 

AAAACTTTAATTGAATAATCTATTAGTGTTATTATTTGAAAGTGAAATCT 

TTTCCTTTTGCrTTCTTGTTCTCACACATAGTGCAGACAGTTTCCACACG 

GGCTCATAAAAGGAATGATTCTGCCTTGTGTGAACTTTTTGCCTTTATTG 

TTAATTGCACCATTTTGTGACTGGCTTCTTGACCCTGTTGTAACCAAGCT 

CATAATG7ACATTATTTCTTATTTTGCAGTTGTAGACACTTGAGGXAGTT 

CCCATTCTTTGTTTCTTCTTGCTTTTGTTCCCTGTGATAACTTTTTCATG 

CAGACA7777T7TTT7T7TTTTTTT7GAGACCGAG7C77GC7C7GTCATC 

CAGGCTGGAGTGCAGTGGCATGATCTTGGCTCACTGCAACCTCTGCCTCC 

CAGGTTCAAGAGATTCTCCTGCTTCAGCCTTTCTAGTAGCTAGGATTGCA 

GGCGTGCACTACCACACCCAGCTAAATTTTTCAAATTAGCCACCCCACCT 

GGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCAACCATGTTGGCCA 

GGCTGGTCTCGACCAGGTGATCCACCCGCCTTAGCCTCGCATAGTTGCAG 

GTGCTATTCTGAGCTCAGGGCTCTGGCAGCTACAAGCCCAAGATGCGGTC 

TCCAACATGTGGCCATTCAATGTCATGGCGCCCTCTACTGGTCCTGGGAA 

GCGCAGCTCTGCCAGTAGCTCCAGCAGGGCACAGCTGTTAAGTCGTGATG 

TTCTACAGGTGACCAAAGGGCAATCTCTGGACTCCTTAGCCGCTAGGTCC 

TCTCTGTAGCAGGACCCAGGAGAAGGCAGGGGCTGAGGATGGCTCTCTTA 

GACATTTGTGATGAACCAAACGTGTGCATTCATGAAACTTCTGTGAGCAA 

GCAGGTGAGTAGAGTTGGGTTATAAAAAGTCTTAGGGTCTCACTACAGAG 

ATGGACTTGCTGTGTAGATGGTGCAGAGCCGCTGAAGAGTTCTACTTGGG 

G7AA7GG7G7GATTGGGTT7GCGTTTTAGGAAGATTTCTTGGCCAGAATG 

AGGCGGGCAACCCAGAGCAGGGAGTGGCCACATGTGGGTGTGCAGTTATG 

GGCCACTAATCCAGGTGATAAATGGTGTCTCTGAACTTCAGGTGGGGGTG 

CCACATGTCTCCATCTGCTCTGTACCCTTGAGACTGGCCTTATGGGCTGC 

CTTAGTGGTCTGTTGTCCTCTATCTCCTGGTTGGGCTCAGGCAATGGGAG 

AT CAGAGGG AGGAAAGAGAGCTTGGTTAGAGTGCACCCGCGC C C CTTCAG 

GTTGGCAGTGGCCACATTCCCCTATACAGAAGGCCACAGTTTCTGTCAGT 

GGCCCTCCCACAGCCCCAGCTTTCTCAGTGGGCCAGCCACCTCCCCATCC 

C77GC7CCTCC7CCTCCAGAGAGGGT7G7GGATTTCCACTG7CAGCAGTG 

CCTGGAGCTCCACCATCTCCTGCTGCTTCCTCTGGACCTGCCTGCAGTTT 

TATAAATAACCTTTCCTTACATTACCTCTAGCATGCACCTTTTGT GTGTA 

TACTCTGCCCCCTGTCAGCACATGACTCATGCCAAAGAGTTTGAATTTTT 

^TCTCCAGGCAACGGGAGGTCATTGGAGGATTTTAGACATTGAGAACAGA 

TGTGTATTGTGGAAATATCTGTCTGACTGAAGTGACCAGGATGGTCCAAA 

AGAGCGAGAATTTGAGGCAAGCAAACCATCAGCAGGCCAGCAGCAGAAAT 

CCAGGTCATAAACAGGGAAGCTGAGGCTCACAGGGTTGGATCAGGGAATG 

GGAGAGGGAAGCCAAACAATTCCATGAGCATGTCAGTTGCACATATGACT 

TGGTAACTATTTTTATTTTTATTTTTATGTTTTGAGACAGAGTCTCGCTC 

TGTCACACAGGCCAGAGTGTAGTGGCATGATCACAGCTCTCTGCAACCTC 

TGCCTCCTAGGTTCAAACAATTCTCCTGCCTCAACCTTCCA GGTA GCTGG 

GAC7ACAGG7GCGCACCACTACACCCAACTAAGTTG7GTATTT7TAG7AG 

AGATGAGCATTCACGCTGTTGCCTTAGACACGG 
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AATATTGATTATTTGACCAGAAATTCATGCAGCTAACCGTGACCCCTGGC 
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JLAAA7AAAA7AG7G7A7,. - G7ACG7GCATA7ACA7GCAAAGAAATGAGV . 

GAAACTAGAAGGATGTCAATCAAATGATAACATGGTCATCTTGGGGTCGG 

AG7ACA777GGGGA7GAGGGGAGC7G7AAAAGCAGAC77GGACC7777C7 

TCTACCAGTACCGTGTCATTTGAATTTTGGAAAGAAAAAAAAAAACTCAG 

AAGGAGGAGAAGGAGCAGGAGGAGAAGAAGATGGATCTTAAGTGATTTGC 

CCGGGAGCACCTTGAGAAGGTGAGATTCAAGTCTAGGTCTAAGCTTTCTA 

A7TCCA7GAG7GGGAGTGAGCCACG7CCAAGAGGAAGC7CAAAAGGAAGA 

TGTTCTCCATCATCTCTTGCTCATCCTAACAGCATGCAAAACCACATCCA 

ATGCAGCTCAGAAAACTCCCAAAT7GCCAAATTTCATTGGAAACACTTAA 

TGCTGTGGTTTCCAATTTCAACTGTAAAGTAGGTATGTATGCCATTGTTA 

CCATTAACTTCTCAGAAATGGAGAGAGCTCTCTTTCCGCCTCCTCCCCCT 

C7GC7G7GGC777GGTGAGACGTGCAC7CAGGC7CACC7G7C7CCA7GA7 

CTCCAGTAAGTACACATGAGCAGAGAGGCCTCAGCTCAGCTCTTCCTGGT 

CCCACCAGGGTTGATTCTTTGAGAATTCTAGAATGCCACATCCTAGGCCC 

CCCAAAGAAATCCTGCATCTTACCCCCAGAAATATGAATCATAGCAAATT 

T CAAAT C AA C CATCGTTTAAT ACT CACAGACTGGGCACAT C C^AAAACAT 

ATTTTCAGTTTTACAACAGTGCCTGGTGCATATCGGCACTATTTGTGGAA 

GCAATAAA7CG ACACGGAGC TGAAACACAAACAAA7GCGA AATT G TT7T T 

ATAACACCTGAl'TTTCUTrCUXSTTTCTTTATGCAGTTTAGTT^GlTl-IG 

CTTAACTC7ACCTCAGACCATAGTCTGGTAAACTCACCACCCAGAAGCTC 

CC77GAAA7G7GGGTA7GCAGCCAC7AGG7GGCAGGAGAGAG777CC7GC 

C7GGAGGGAGGACAGCCAC7C7G7CCCCGGG7CAGGCCAGGGCCACCC7G 

C7ACC7GCAAAA77AGCA7GGGGC7T7A7GAACCACAGC77CC7AA7AAA 

CACAGGA7C7GTTTGA7AGAGAC7CCAAAACACGCC7ACC7AG7GA7GAA 

AGAC7CAAC77CAGAAGAAAACCrrCA7GGCAAACA7C77CAGAGA7G77 

TCCAAC7TAAGG77C7GAACACAGACGC77CCCCAGAAAGCCA77G777C 

7CAGCACC7GGGAGCC7TGCTTTGC777GC77ACAGACTCGC7GT7C77A 

AA7G^CTGCCAAGATAAaVTCTG7C7C77C7CTTACCCTC7A77TCGA7A 

7AAGGAC7CC7CACTCTTGTTGC77CC7AT7GGC7ACCTC7CCACAGGGA 

GAAA7CGC7GA77TAACAGCAG7CAA7ATCCCAAA7CTGGAACAGGGAAC 

AGGGAAGCA7TTAAAAA7TGGAGAATT7AGGCCGGGCACAGTGGCTCATG 

CCTG7AA7C7CAGCACTTTGGGAGG7CGACG7GGA7GGA7CAC77AGGAG 

77CGAGACCAAGCCTGGGCAACA7CKK:GAAACC7CATCTC7ACAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAACCCAAAAA77AGCCGGGCA7GG7A 

G7GCACACCTG7GAGCCCCAGC7AC7CAGGAGGCTGAGG7GGCAAGACTG 

C7TGAGCCC7GAGGTCGAGGC7GCAG7GAGCCGAGA7CACACCAC7GCAC 

77CAGCC7GGGCAACAGAGTGAGACC77G7CCCAGA7AAA7AAA77AAA7 

TAA777AA77AGAGGA777AAGGA7777CCC7ACAGACACC7CC77A777 

7C7C7GGCC7TT7CTGAC7ACTC7CCC7AAC7CCC7GC7CC7CTGG7C7C 

CCAAAAC7AC7CCAGAAAAAAAAAAGGGGGGGAGGGAC7AAAGGAAAGCC 

AGG7GACAG7GCCAGTGTGACAGATGACAAAGCATC7GCCCGAACAAACC 

G7AGG7CCC7GAACTTTCTCCAAGACCTGTCTGTGGACTTACCTATGAAA 

ACCAG7777AGCAAAAACCCTCCTAAGCCAGTTTATCAAGA7CCCC7TAT 

CC7CAA7A7CCATCTGATTGGATTCTTCATCCCCCACCATTCCCCAGTGA 

TG7CACCAGGCC7T7C77CAGCAACAG7AG77AG7GGG7G7AGCCAGGAC 

GCCCCC7CACCCCTGATATGCCCTT7TAG7AAT7C7TCATCCACAGGTTC 

CCACCC7GC7CCTAGGCTATACATTCCCA7TTGCCCATGCTGCA7TCGGA 

A77GAGCCCAG77CTATACTGAGGTCTTACTTCACC7CTCGCCA7AGTCC 

7GAA7AAAA77GGTTTTCACAT7TAAAAACTG7CCAGC7C7GG7TGT7CC 

77GACACAGGGTAATTTTTATTCCATG7GATAG7TTGCCT7ACCTCAGCC 

7ACACCCC7CAAACCTGCAACTCTATAT7CAAGAACCAGACAGCCCTTTC 

CAACAGA7AGGAAGAGGCTGCCCTGGTGCAAAGGAAGAGGCTCTGGGAGG 

AAGGAGAGAACCCGAAGGCTGCCCCC7CC7CTAGAC7GAGC7C7GGGATG 

GG7GGACGA7AAAACCCAGA7ACG777AGACA7C7GAGCG7GGAGAGGAC 

777GC777GC77CCACAGGGACCCCAAGGAAACTGCAAGCCC7CCAGAGA 

C7AAAAACAGCAGAACAGCAAGAAA7GGCAGCAAAGG7C7GGGCAGAA7C 

A7CC7A7G7GGGCACAGACACAAACAGAG7CCCC7G7GGCCCCAGGAGAG 

777AAAGAAGATCCAGAGGC7G7CC7A77CCA7A7C7CAGCAGAGACAGG 

CCCG7GAGCC7AAAAGC7GATCA77AGGACAAGAAGGACACGAAC7G7CC 

7GCAGCG7GAACCGCG7GGAACAAGGCCAA7CACCAGACACCAGACCAGC 


FIG. 4 (28 of 61) 


WO 99/06426 


PCT/US98/16102 


CAGACACAGCCCCGCAG1TCCCCAAGACCACCACGGACCCATCGCCCCTC 

ACCAATAGCTCCAGGCTACATAGACCCCCTCCACTTCATGGATGTCCTCA 

GAGCAGAAAGGGGAGGCAGGAGTGGAACCCTGACTTGGTTCAGTTGAAAC 

ATAAAATGACTGTACTATTATTGAATTGCTGAAG7TTACGTGAAAGAAAT 

GAGATTTAGTTTTTGGCCACAGTGCAAAATAAGAAACGAGGCTTCAACTG 

AGATTAAGGTGAGTTATAGGAAAATGTACTCCCTTGAAGGACCTGTGAAG 

TGTGTTCGCTATGAGAAAATGACCAGAATCCACGTTCTTAGCTGCGGGAC 

TCAGGCTGACTCCTGTTTCTGGAGCTTGCACAAAGGGCAGGGAAATCCCT 

GTTTCAGGCACAGTGATTTCAATGTTTAAAAGAAAACAGGTGGGCCCTGG 

CAATCATGATAACATGTCATAAGTTTACATCTCTGTGAGGCAGGTAGTGT 

AATCCCCATTTTGCAAAGGAGGAAACCGAGGCTGAAAGCAGCTACATGGT 

CTCTTCAATGTGGCCCAAATGTTGGAGAACAGAGCTTAACTGAATCAGCA 

ATTCTATACTTAGAACTGACTCTCTCTTTATTATATCTCACTACTACCTT 

GATATTTGAAATATTCAA C TTTT TT CAATCAAAAAATAACAATAATTTAG 

GCATAATGACTACTATGTCATTTAATTTCTTGCTGATATTTCAATATCCC 

ATGCCAGGAATATTGAAAGCTCAGCTCCTTAAGAGCTGACTATGGCATCA 

ACTCCCAACAACCATCCTTCCAGAAATATTTTCCCCTTTCTTTTGTTATA 

GAGTGGCACTGCCCTATATGGTGACCACTTGCCACATGTGGCTGTTGAAC 

ACTTGAAATTGGC7TGTCAGAATTGCAGTGTAAAGTGTAAAACACATACC 

AAATTTCAAAGACATGGCACATAATAAAAAATGTAAAATATCTCATTAAG 

AATTTTTATATTGACTGTGTAAGTAACATTTTGAATATATTGGATtAAAT 

ACATGGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTCTACTTCA 

CATTTTTGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCTATT 

AATGTC GTCAATAGGTTCTTGGGAACCACAATTTTAAACAAAATGACATA 

TAAGAAAACGAATAACATTGAACAAAATGACATTATTCGAGGACCTGCTG 

CATGTTGTTTCACTTAAAGTCAGTGTCCAAGAACCTATCAGTGACATTTA 

GTGAGGACTTGCTGTCCTTCCTGTTTACAGGAACCTGGGCAAGTTACTTA 

ATTCCTCTAAGCCTGGTTTATATCCCTGCAAAGAGAGAAGGATAATAATC 

ACCAGTACTTAGTGATGTCGTAAGGAGAAAATAAAATAATAAATATGAAA 

TGGCTGACAGTGTCCTTGTCACACAGAAGATGTGTGATCCACAGTAGCTG 

CTATTGTCTGCCTCACTTCACTAGTAATGGTCCAGGGAGGCCTTTAATGT 

GCATGGTGCAGTACATTCACATGTTGGACATGGGTGAAGGGAAAGACCAG 

GCTCATCTAAACACAATAGGATGCTTGTGGTGTTTTGAGGAGGAATCAAG 

GACTAGTTATCCACAGCTGTAACATGCATGGATCAAAAGAGATAAGGCAC 

ACAAAAGACTTTGTCAGTAGCAAAGCATTACAAAATGCAGAGACCAGCTG 

TGGGTGGTGGTGAGTCAGACCCAGCTTCCCTCTGTGCCTGGCTGAGTGGT 

TCTGGGCAAGTCACGCCATCTGTCTTGATGCCCTTCCCCATCTATAGAGA 

GGGAGCAACTGAGGCCCCTTCCAATACTGAAGTCCTTTATTTCTGCTACT 

TTAGAAATATCCACATTTTTGGTAAATTCAAATGATCCAATGATTCCATT 

TCCTAATGTTCAAAACTA GCCCCA GAAACATCTAAATGAA TCAAA CAAAT 

AAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGCAAC 

ACACACACACACACACAGAACCCATAAGCCTTCATCTTTCCTTGGATAAA 

CGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTTCCA 

ACTCGCCTTTTGCAGCAGTTCAGATGGGTCTTTCCTGCGTGGCAGTGGCC 

CTCCTGACTTATGATTTCCTGTGTGTCGGCeTGTTACCACTGCAGCTTAA 

CTGAGGAAACAAGAACAAAACAGCTTCTGACCCCAAGAGACTGTTGGAGG 

CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCCCAG 

CCCTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGCCTT 

ATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTCAGG 

GCTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTCCCT 

GCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGTTAA 

GCAGCCAGGTTTGGAAGTTCTTGCTGTGCAGGCGGGTGTGTGCATGTCCT 

CTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCACGC 

CCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGGGGC 

CCAGGCCATGATGGGAAGCACTCTCTGCCTAAAGACTAGGGTGATGCGCC 

CTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTTCCT 

CCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCATAAA 

C7GAAAATAACAAACCCAGGGCTCTTTCTGTCACAGGAAAGGGTTTTTTT 

TTATAAAATTAAACAGAGATGATTCAACACACCCAGGATATAACACATGG 

GCCATGAATCAAGGGCAGCATTGCTCTGGTCAGCCTGTTGTTTGGGCCCC 
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CTTGGCAGGGCTCTCCCCiGAATCTTCCCCTCTTGACTCCCATCAN'CACA 

GCACTCCANCTTTGTGTTACAGGCGATAAATGGGAAAGGGGTAAAT 

>Ccncig48 

CATTCTTAATTAGAGAAACGCTCATTAAACTAGACACCCAAATTCTCTGG 

GGGGGGATCATTCTTACAAGCATGCCCTTCTCTCTTAAAGAGAGAGCACT 

TTTTTCGCAAATAATGCTGCCATGAACATACGGGGTGCATGTATCTTCGT 

AAXAGAATGATTTCTATTTTGGGGGGTATGTACCCAGCAATAGGATTGCT 

GGGT CAAATGGTATTTCTGGTTCTAGATCTTCGAGATCTTCCACAC CGTC 

TTCCACAATGGTTGAACTAATTCACATTCCTACCAACAGTGTGAAAGCAT 

TCCTATTTCTCTGCAACCTCGCCAGCACCTGTTATTTCTTGACTTTTTAA 

TAATCG7CATTCTGACTAGCATGAGAGACAGTATCTCGTTGAGGATTTGA 

TGTGCATTTTGCTAATGATCAGTGATGTTGAGCTTTTTTTCATATGTTTT 

TTGGCTGCAAGAATGTCTTCTTTTGAGAAGTGTCTGTTCATGTCCTTTGC 

CCACTTTTTAATGGGGGTTTGTTTTTTCTTGTAAATTTGTTTAAGCTCCT 

TATAGACTCACAATAACAAAGACATGGGATCAACCTAAATGTCCATCAAT 

GATATAACGGATAAAGAAAATGTGGTACATATATACCATGGAATAGTATG 

CAGCCATAAAAAAGAATGGGATCATATCCTTTGAAAGGACATGGATGAGC 

TGGAAACCATGATCCTCAGCAAACTATGCAAGAACAGAAAACAATTGTTG 

CATGCTCTCACTTATAAGTGGGAGCTGAACACTGAGAACACAGGGACACA 

GAGAGGGGAACAACACACATTTGGGGCCTGTCAGGGGTGAGGTGGGGGAG 

GGAGAGCATTAGGAAAAATAGCTAATGCATGCTGGGCTTAATACCTAGGT 

GATGGGTTGACAGGTGCAGCAAATCACTGTGGCACACATTTACCTATGTA 

ACAAACCTGCACATCCTGCACACGTACCCCAGGACTTCAAAATAAAGAGA 

GACAATAC7TCTCCCTTAAGTGTCTACTGTTGCTTTGCAATAAAAAGTTC 

CTGCCTTTCACTTCACTCTGACTTGTCCCTGAATTCTTTCTCGTGATGGT 

GTCAAGAACGTGGACACTGGCTGGGGCTGGAGACTCACCAGCATCCGGAG 

ACCCTCCTGAGCCCTCCAGCAATACAACTTTGACACAAACTATGAAATCA 

CAGATCCAAGAAGCTCAAAGAACCCAAGCACAGGAAACATGATGAAACTA 

CATGAAGGAACATCAGAATTGAATTGTTCAAAATCAGTGATAAAGAGTAA 

ATCTTAAAAGCAACCAGAACAAAATATCCATCATATACGCAGAAATAAAG 

ATAAGTATGACAGCAGATTTACAAATAGAAAAAAAAACAAGTGCAGCAAC 

AGAAACAAACTATCAATCCATAATTCTATACCTAGTGAAAATTTCTTTCA 

AAACAAAGGTGAAATAAAAAAATTATTTTCAGGAATACAAAAGCGAAAAA 

ATTAATCACTAGCATTCATCACTGCAAGAAATGTTAAAGGAAGTCCTTTA 

GGCAGAAAGAAAATGATACAAGGTGAATATTTGGATCCCTGCAAGGAACT 

AAAAAGATCCAGAACTGATAACTTAATGGGTAAACATGTAATTTTCATCA 

ACAAGTGAATGAATAAACAAATCATGATATATCCATATGATAGACTACTA 

CTTAGAATACAAAAGAAGAACTACTTATGCATGTGATAACATGAATGATA 

TTCAAAATTATTATTGAGTGAAAGACACCAGATCAAAAGAAAGTACATAC 

TGTATGATTCTGTTTATATAAAACTCTATAAATTGCATGCTCTTCTATAG 

TGACAGAAAGAAGATCAGTGGCTGCCTGCAGACAGGAAGAGATTACAAAC 

GGAAATGAGAATTCCTTAAGAGATGATGGACATGCTCATTACCCATCATA 

TGTATACAGCCATAATGGTTTTACAGATACATATATATGTACACGCCAAC 

ATAAATATAAGTTATCAAATTACAGTAAGTTCTGACTTAATGTCACTAGG 

TTCCTGGAAACTTTGACTTTAAGCAAAATGAXGT ACAGT GA AACCAA TTT 

TACCATAGGCTAATTGATATAAAGATGAGTTAGGTTTTTGGTTTTTTTTT 

TTTTGACATGAAGTCTCGCTCTATCGCCCAGGCAGGAGAAGAAGAGTTAG 

GTTTTACAGCATGTTTCTGGTCACAAGAACATCATCAAACTTGTAAATAA 

AGGCACAAAACACTTCTAATATTAAATATCAAAATAAATATGAGTTATAC 

AGAATTTAAGAAAGATTAATAAAAACAAGTAAAATCATTATTTATGGGAT 

TTTTGGTAATCAGTGAGTTATGTGGTCATAGTGGAAGTGGGTTAAGTCAA 

GAAATAAATGTTTGCAAAACAAAAATTTTAAAGATCCTCTCCTACCACCA 

CACAAAAAACAAGAAAACACGGTGGGCTCGCTAAGCACTTTTGTACCACT 

CGTATCTTATGCGTTrGTATGATTATTGTAAATGCTTTATGATAATTTTT 

AGAGACAGGGTCTCACTCTGTGTCTCAGGCTGGAGTGAAGTGGTGCAATC 

ATAGCTCACTGCAGTCTCAACCTCCCGGATTCAAGAGATCCTCCCACCTC 

AGCCTCCAGTGTAGCTAGGACTACAGTTGTGTGCCACCATGCCCATCTAT 

CTTCTTTTTTAT T TTTTGTAGAGACAGGGGTTGTGCTTTGTTGCCCAGGC 

TAG7CTTCAACTCCTGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAA 

ATGCTGGGATTTCGGACATGAGCCAGCAGCACCTTGCCCAGCATTTTATT 
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TCATAATAATTATAAGTCATTCCTTCATTCATCTTACAACCCACTTGTTC 

CAGTTCAGGATCTCGGGTGACCAGAACCTATTAACGTTCACGCACAAGTC 

AGAAACCAGCCCTGGACAGGACACCATCCTACCGCAGGGAGAACTTACAC 

ACCCACACTCACTCAGACTGGGACCATGCAAAGAACCTAACGTGCACTTT 

GG AATGTGTG7TC CATACC CACTAGAACAGCTAAAATTTAAAAGACTGAC 

CATACTTGAGTGTTGAACAGGATGTGACACAACTAAATCTTTTAAGCGC7 

TCTGCGTAAATGGCAC^GCCGCTTTGGAAAACAGTTGGCAGTTTTTCAAG 

TTAAATATACCCAAACTCTATGATCCACTTCTCAACAATCAAACAAGAGA 

AATAAAAGCAATGTCTACACAAAGATGTATACACAAATGTTCATTGCAGC 

CTTAATTATACTAGCCCCAAGTTGAAACAAGCCAAATGTCCATTACCAGA 

TGACTGGAACATACAAATTGTGGTATATTGATACAATGAAATACTACTTA 

GTAATAAAAAAGAAAGAGCTATTAACATAAGCAACAACATGGATGAATCT 

GAAAACAATTATGCTAAGTGAAAACAGCCACACAAAAGTTACATACTGTA 

TGATCACATCTACATAAAATTACAGAAAAGGCAAACTAATCTATAGACAG 

AAAAGCAGATGAGTGGTTACCTAGGGATGGGGCAGAAGGGACGAAAGGAT 

GGATTGCAAAATAGCACAAAAATATTGGAGGGATGACAAATffTATTCATT 

ATCTTGATTGTG GGGA TAGTTTAATGGGTATATATAGAGATCAAAGCTCA 

TCTAATTATACACTTTAAATATATGTATTTCATTGTGCATCAGTTATTCA 

TCAACAAGACTATAAAATAATATATGCCTACATACATTTTTAAATATTCA 

AAATCTCACAGTTATATACATAAATGCAACTGAATATGTATTCAGATGTT 

TTAACAAGCAGAAAGGACTGATTAAACTCATGACAGCGGCTGTTTCTGGG 

AAGGGTGTAGGAGACAAGAGATGGAAAAGAGGATGAGAGCCAGAAGAGAC 

CCTTGTAATGTTTCCTTTCTTT7AGTAAAAATATATTGACAGTTAAAGCT 

GAGAGGTGAGAATAATAGTCTCATGGCTTTTGTGTCCTTAAAATTTCACA 

AACTAAGTGAAATGGGAGAAAGCAAAAAAATAAACTTAAATAAATGTTAT 

ATTGCCCAAAAAGAGATTTAAAATGGAGGTTAGACACATGAGACTTACGT 

TCTCAAAAAAGTAGAATCTGCAGGGAAGTTTAACAACTATAAAGAATTAA 

AATCTAGCTTCTACCAGCCCAAAGCCTAAAATGTTCTGCTTTATTCTTCC 

TTATTATAATTCATAGGTAATATATTTTATGTTTGCAAATGAATGCAGTG 

ATATTAGATCTCTAAGAGGTGCTAAAAATGAAAAGTACATATTCCAATTT 

TTCCCAATTTTCCTTCTCTTTCCATGAATGAAAAATATACATATTTGATG 

ATTTCCAAGTTTATACAACCGATCTTTCTCrTAGTTTTCTCTTACCAAAT 

TCCCTCCCTCACTCAGCCACCAGCCAGTCCAACTGTGCTACCTGCACAGC 

AGCCCTCATACCATCCACACTCTCATCAGGATCCTGCCTGACCTGCGAGG 

AGCAGCAGCAAGAAGGAGACAGAACCTCCACGCTGAGCATCTCAGGGCTT 

T CT C A G AG ACTC CAGAGGAC C CTGAT AGGGACAGAGC CTGGC CAGCAAT C 

CATGCTGCCAGCTGTATGATTGTGGGCATGTAAATTCTCAACTGAAAATG 

GGTGTAATAATAACATGTTCTTCCCAGAATGAGCTTTATGAAGATCATAT 

AGCTGTTTGGAACTCAGACAAGCACTGGTAGGAATACAAACAGGGGAGCC 

AACAGCCTATAAATAATACTTTAAGAAAGGGCATGAATGTAATTACTTAG 

GAACAAAAGGCAAAGTGGAGAGATGCCTAGGACTGAGCTGGACAAGCTGG 

ACCCTTTAGTGGCTCAGCCCATGGGCTGACAAGGAAAATGGAGGAGCTAC 

CAAAGAAGGTGGAAGGATTCTGGGAGAGTGGCCCTCACCCTGCCCAGGGC 

AGGGCTCAGTGGGAGAGAGGGAGATCTGTTATAAATGCTGCCAGGAGGTC 

GAGTCATGTGAGAATGTCCATGTGAAAACATCCACTGTGTGTATCTAAAG 

AGAGTGGCTGTAAAACAGGTCAGGGTCAAAGGTCTTATTGTCTCAGATGT 

TATCTGCATGCATTGTCTCACGACCAAGAAAACTAAGGAGCATGGACACA 

AAGGGTTAGGTTGAAGCAAAAATTTAATAAGTGAAAGAAGAAGGCTCTC7 

GCAGTGGAGAGGGGAGTCTGAGTGGGTTGCCACTTTGACAGCTGAATCCA 

AAAGCTTTTATAAGAAACTCTTCTCATATCTGCAGCTGTTTGAGTAACTT 

CTCTTACCTATAAAACTGTCTGTATAACTCTCCCTTATCTATGCAGCTGT 

GGGATGTCTCCAGGTAAGCATAAAGTGTAGCTTCTCTTGTTTGTATAACT 

GTGGGTTTGTTTTAGGCAAGCCCCCATCCCCTCCCTGTGTAAGCTCCCAT 

GGAGCCCACCATGTGCATATCTGAGAAGTGGAGGAAGCTTTCTCTGGGAG 

CTCACTGATCGTACAAAGAACAAGAGGCTTCTGTGCCGCTTATCTATTCA 

GGTGCAGCCTGAGTTTTCCCCAGGCTGCTCTATTTTTGCCTGTAGCTATG 

ATTTTTCAGGCAGGCTGCTTCTCTGAAGACTAGCCTTAACTGTCTACCTA 

TCAGATTTTTCCTTTTCTTCTCCCTCAGCTGGTTCCCCTCACCAAGGCTG 

AGCAAGTGAAAAGGAGGGCACAGGGCAGGCCAGTAGTGAGCAGGAACAAG 

GAACTAAGACAGCAGAAACCACTCTTCACAC CTGGGTTQAAAGGGGTGGG 
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GAGCCAGGACTACAGC1 uAGGTAAGAACATAGGTAAAGAGATACTGTTGT 

TGTGTTGTTTTTAACTATGAGAAGCATTGAGCTTTAAATTTCTACAGGAA 

GGA7CCAG77CAGACAGGAGCACCCAA7A7TCAGAAGAGAAGAACA7GG7 

GTAAAGGTCCTGGGAAGGCTGAGAGGATTGGGACTCAGAATCCAGAGCAG 

AAGC CG7C7G7GAACAGAAGAAGGACCTCC C CCAG7GTAGCAAGAGGGAG 

GGAGGAGGGACAGATGCCAAGA7GGT7CAGGAAGAAGG7TTGGTGG7AAA 

7GJGAGGC7G7GC7CACCTGC7GGCTTCAATTTTC7CTTTAAAATG7CAG 

ATGGAATCATTTGATGAAGGCCATGCCATGCAATGAAATGGCAGTCTGAG 

GCATGGAGCAGCTCCAGCTTAGCCCGTGTTTAGGGTAATTATGGCTCCAA 

CCCAGGAGATGAATATGACTAGGGAAAGTGAAGTCCAAAAACAAATGGTC 

TCAAGTTGACTGTGAGTCTTCTGGGAGGCTGAGACGACAGGTGGGGTTGA 

CAAGGGAAGGGGAACCCACCTGCTGAAAAACATCAGGCTGTTGGCTGGGG 

GAGGGG7GAGGCCTGTGTTGTAGAGATGGATGGATGCCTAAAGTTGGGTA 

AAGGTTTCAACTCTACCCTCTGCTGGGTGTGGAAATAAACAAAGACCACC 

CAAATGAGAACAAACAAAGACTATTTATCCAGAGC7TGCTCTGACAAGGG 

AGTC GGCAAC CAT CACTTGCTTGGCAGAG ACTCAGAAGTAAGCAGGGGAG 

AAAGCC7CA7AGCAGAAAGAAGGGAAGTCT7CATGTATGCCCTGAGTGGC 

AGCTGTAGATGTGGGTGAGTTGCAGGTGGC7AACTAGAAATGGGGGACTC 

C7GTGTGATTGATTAGGAGCATGTTTGGCTTTCTCTGGTTGGTCCTACAT 

TGGAAGAGGGAACAAAAAATTTAGGGCAGTTGTCAGTTATTAATCAAGTG 

TTGGCCATTTTTGACTGACTGTTACAGGAGTGACTGGCTCCCTGGATTGT 

TTGCTAGAAATAGTGGTCTTCACTTCCTGCAAGTCTGACTTTCTGGTAAT 

AGGC77CC7GGG7TGGC7ATTGTGGA7AATAAGTGGGTTTCC7GAGCXGA 

777C7GCAGATTG7GGATCAGAGT7A7TTTA7ATAAACAGTCTGACCATT 

TTCCACTGGCATATTCCATCTTCCAAGAGCTGGCCAAGCTGCTGTCTTAT 

CTGTCTCCCCCAGCCCCTCCACTCTGGCTGTGAAAATACAAGCCACTAGG 

TGAGGAATGGGGACAATTGAAGACTGAAAGCTTTTCTTTGCTGGGTTCGC 

AGAGCTGAGGAAAGAAATGACAACATCCAAGTGTCTGCCCTGGGCCAGTT 

TTAGGACTGTAGTGGTAATGCAAGGACTGTGTGAGTTTATATTTTCATTT 

GTCTCTCTAACTAAGGTGGAAAAAAAAAAACAGAAAATTGTCTGTCTGCA 

GTCrCTGCAAAAGTCTAACACTGTGCTTCCCAACATTGCAGCCATTAGCC 

ACAGGTGAGTATCAAGCACTTTAAATGAGACTGGTCCAAACTGAGATGTG 

CTCTGAGAATAAAACACACAGCAGATTTCAAAGACCTAGTACATGCCCTG 

ATTTCAAGCTATATTACAAAGCTGTGGTAATCAAAACAGTATGGCATTGG 

GAAAAAAATAGACACATTGGTCAATGTGACAGAATAGAGAGCCCAGAAAT 

AAACCCGTGCATGTATAGTCAACTAATCTTTGACAAGAGTACCAAGAATA 

CACAATGGGGAAAGTCTCTTCAATAAGTGGTGTTGGGAAAACTAGATATC 

CACATGCAAAAGAAAGAAATTAGACCCTTGTATTACACAAAATCTAAAAT 

TAA77CAAAATAGAAAAAGACTTACA7GTAAGATCTAAAACCATAAAAC7 

CCTAGAAGAAAACATAGGGAAAGAGCTCCTTGACACTGGCATTAGCAGTA 

ATTTTTCAGATATAACATQUUUW3TACAGGCAATGAAAGCAAAAACAAGT 

GAGAGTATATCAAACTAAAAAGTTTCTGCACAGCATAAACAATCAACAGA 

GTAAAGACATGACGTATGGAATGAGAGAAAATATTGACATCTGACAAAGG 

GTTAATATCCAA AATATATA AGTAATTCACACAACTCAGTAACAAAAGCC 

AAATAACCTGACTTTTTTTTAAAATGGGCAAAGTACCTGAATAGGTATTC 

CTCAAAAGAAGACATACAAATGGCCAAGAGATGTATGAAAAGCTGCTTAA 

CATAACTAATCATCAGAGAAATACACAAATCAAAACAAGATATCATCTCA 

CACCTGTTAGAATGGCTATTATTAAAAAATGAGATAAGTGTTGGCCAGGT 

GTGGAGGAAAGGAAACCCTTGTACATTATTCATAGGAATGTAAATTAGTA 

CAGCCATTATGGAGAACAGTATGGAGATTCCCTAACAAAATTAAAAATAG 

AATTACCATATGACCCAGCAATTCCACTTCAAGGAATACATTCAAATACT 

ATCAGTATCTCAATAAGATACTTGCACTCCTATGTTCGTTGCAGCGTTAT 

TCACCATAGCCAAGATACAGAAACAAGTTAAATGTCCATCAACAGATAAA 

TGGATAAAGAAAATCAGGTACATATATATATACAATGGAATATTATTCAG 

CAAAATCCTGACATCTGAGATAACCTGGATAAACCTGGAGGACATTATGC 

TAAGTAAAATCAAAGCCTGACACAGAAAGACAAATACCACATAATCTCAC 

TTACATATGAAATATGAAAATGTTAATTTTATGGAAACAGAGTAGAATGG 

TAGTTGCCAGAGCCTGAGAGTAGAGAAAATGAGATGCTTGTCAAATCAAA 

TCATCACA7TGAATATATATAATCTATTTGTCAATTAAATATTTTAAGAA 

7AAAAAA7ACC7GGCACCAAAAAAAGAA7GCAAAATGTCTCAACAATGTT 
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ATATGTATTGCATTTTQ^AGTGATAATAATTTGAATATTAGGTTAAATaa 

AATATATTTGAAAAATTAACTTCACCTATTTCTrrcCATTTTTGTTAACA 

TAGGTACAAAAAAAAATTAAAATTACCTATGTGGCTCATGTAGGTGGCTC 

ACATTATACTTTGATGACACTATACAGGCTGGTGACCATATATCTCTTAG 

ACTAGTCTAAGTGATTTAACAGTGGTTCCAGAAAGATCCAGGTTTAACAC 

CAATGAAAGGGCCAGCTGGCTTAGCCCAGCTTGTGTGGGAAATGTTGGGG 

AGXGGTTTAAGACAGGGAAAAGCAAAACTTTTGATGCTATTGACTTTTTG 

AAAAATC7TTTGTGGCTGAAAAAACCAAAACATTATT 

>Concig4 9 

GCTCGAGTGTGTCTCTAAAGCCTTTCCCCCATTGGCTCCACTATACGCAC 

TCTCCTGGTTTCCTCCCCTCTAGCCGCTGTCTTTGGTCTCCTTTCTGATT 

TTGCTGCGTCCTCTGTCCCCTGAATGATTGCTTCTCCACTACGGGGTGAT 

TTTGCTCCCCAGGGGACATTTGGCAATATCTGGAGAGGTCTATGGTTGTG 

TTTGAGGGTGTTGCTACTGCCATCTAGTGGGGAGAGGCTAAAGATGCTGT 

TAATGCCCAGGACAGTCCCCATAACACAGAATTATTCAGCTCAAAATATC 

CATGGTGCCAAGATCAAGAAACCCTGCTCAAATATTAGCATGXGCTGAAG 

GCCCTTCTCTTTCCTTTAGCAATATCTGCCTCCTTAGGGATCTTTTCTAG 

TCTCAGTGGTTTAACATTTAAAATCCCAAATTAGGCAATAAATTGGGCCC 

CAAACTTCGTTAGTATAAAATGTAGAACTGTGTTATTAGAAGGCTAATAA 

AATGACCTGGTGAGCATCTGCAGCTAGCCTCTGAGCAATTCTGGGGACCA 

CGTGCAAGATAAATCCATCTGTTCCCTCTCTGTAATGTGGCGCTACCTTG 

TGGCCGATTTTTCCTCGGGTTAAATATCTCTGGGGATGCAACTTGTCGTG 

GTTAATGGCTGTGTGAGGCCAGCGCGTGGTGATAAAGGAATCAATCAAGA 

CAATATTGAATTTAGAAAGGCAGATTTATTTAGAGAAAAGGAGAGATACG 

TTGCAAGGGAGCAATGGGCAATACAGCAGAGGGAAGGCTGTCTGCAAAGA 

GGCAAGGGCTACGTATGACGTAGGGCTGCTTAGGCTGAATGCTTGCAGAC 

AAGATGCTTGCGTGCAGGTGGGCTGTGAGCTGAGTGCTtGGGTGCTAGTG 

AGCCA TTGGCAGCTGACCCTATTTCTTGGAACATT CGCT CCCT GCAA GCA 

riTTAATGTTAAACCGCCAGGTCAGTTTGAATTTrCTrriTTCTriTlTr 

TTTTTTTTTTTTGCCTTTAGTAGGACCTGCCGTTGTGAGACTATCTGAGG 

TAAATTAGACACCCTCCTGGTTTAAGTCACCGCTCCAGTGACTAGGCAGG 

GAGCTCTTCCTTGAAGAGGGTGTGGGC AGTG GGTACTTTGCATGTTGTCC 

ACACCAGGCGAGCTGCTGCTTCAGGGCCTTTGCATTTGCTCTTTTCTTTG 

CCCAAAATGCACTTCTCTCACTGTTCACATGATTTTTCTCCCTCTTTTCC 

TTTTAGTCTTTGCTTAAATATCACCTTCTAGGGAGGCCTTCCCACACCAC 

CTCTTCAAGATTTGAGGGTATGCACCCCCACCCCTAGCCTTCTTATCCCT 

CTCCACTGCTTTCTTCTCA AAGCA CTT GTTACG TTCAAATAAAATAGATT 

AGTTACTTTATAGTTCTAATTTTACTATTTTTTGTTTACTTCATCAATAC 

C CATGTAATCTCTGGAAGGAACGTTTCTTTTTGTAGTGTATTTCTAGCAC 

CTAGAACAGTACTTGGCACATGGCAGGTGTTCAAAAGTATTTGTTGATTA 

TTTTCTCAAAGGGCATGGAGTCTTAGAAGTTTGAGAACACAGTTCTAAGC 

ACAGCTGTTTAGAGACTATGGATGATGCTAATGGCTGTATTCCCAGTAGG 

TGGGGCAATTCTCAAATTGACCTGGAATCCTTGAGATCTGGGGACAGTCA 

CCAAGCACTGGGCTCTGTGGGGAGAGATGTGCTGGTTTTTAGAGAGGAGA 

ATAGCATCCTGGGGGACTTGGCCCCAGGGCTTTCCTGTCCCAATCTCTTC 

CCAACTGAGTCCCAGAGGCAGGAGGCCTTGTCTGTAGCTGGTCAGTCCTG 

TAACTGTTTCCCTCCCATCTACACAGATGCAAAGAAGGCTGAGAAAAGCA 

AGCTGTCAGGTGAGCAGGGGCCCTGACTCCTCCCCAGAAGGCACTCAGAA 

CTrCCATAGGGCAACrGGAAAGAAGGTTCTACTTCCTCACCGGCAGCTGT 

TGCTGGGGAAAAAACCAGCCTCAGGCCCTACCCTGTGCTGAGAACCTGAA 

TCCAGTATCAGGTTCTCCAACAAACTTGGATCCAGCTGACCCTCACAAGG 

GGTCAGATGCAACCTTGTAGCATATGGAAAATGGCAGCAAGGTCCTTGTG 

TGGACTATGCCTAGAATCTAAATTAAGACAAGGCCTCAGAGGGGCTAAGT 

GACATCTGTCTCCAAAGTTTCACAGCTAGTGTGTGACTAAATCTTGATTC 

CACCCTCTCAGGTTTTACCA TAAT CCCAAAAAAGGTTGAAACAAGAAAAG 

TTATCTTTGGGCAATTACCTCTTTCTGTTCCTTGCTTTACCTACTAATGT 

TCTAGGCTCACCCTCTGGTCTGCAATCTCACTGAACTGACAGATCCCTCA 

TGGCCTAAAGGGTTTTCACACTGGGTTGACTAGGCTCTCCGVTTGCCTGT 

CCTACTGTCTAAGGCACCTCCTGGGTAGGGTGCCCAGCGTCATTCTGATG 

CTGCCTGACTTTCCTTCCAGCTACTTTTGAAACTTGGTATCCATGGCAGA 
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GGCTTAAAGGGCATGTTC-JAGGTACTTTTATTTCG^AATTCCCCAGiTGGc 

ATCAAGGAAATCAGCATCTCTGGATAGCTCTACTAAGGCTTAGTTCTCAT 

TGTCCAATCTAGCTCCTGGGTCATGGGAGGCATTCAGGAAATATTTGAGT 

GTAAGAGTGAGTTGCTTTACCTCCAGAATATCCTTCCAATGGCTCTGAAG 

CAGGCTGTGGAGTCCTGCTGGCTGATCACAGTTCACAGGTGGCTCCCAAA 

CCTGTGGTCTACATCCATCCTTTGTCAGTGTCACTGCCATTGTCCCACAA 

AT^TCATTTGGGCCTAGCCCCTGGGATAGTAATCAGTCTTTACAtAGATA 

TACATTGTGCTTTACATCCACAGTAATTCTGAGTGGACCTTAAAATAAAT 

TCCATGTCAGGTCTCACCAGCCCATGGGTTACAGATGGGGTTACCTTTCA 

GCCTTGTAAGGTGCCCCGTCTTTGAGTGTAGACATGGACTCACAACGAGT 

CCACTCCTGCTGTTCCTCTGCTCTTGCTGAGGCTTCTGCTGCTGCTGCTG 

CTGCTTTGCAGAGGCTGGCCAGCTGTGGTGCCTGAGGCACCTGTGTCTTC 

ACAGCACCAACTTGCATGGTGGCCACGGTGTAGTTGGAAAGGGATGCTTA 

GATGGGAGGCCAATGGGAGCTGCTTCAGGAGGCAAATCCAAGTCACAGAG 

ATCGAGTCACCGAGAGCATAGTAAACTCAAAATCCCTTCTTCTGCTTAAT 

AACTGAGATGCTGTCACTGGGTTAACCTCACCAAGCCTTGrEPTGTCTTC 

ACTTAGAGTGATTTCTGTCTTAGAAGGCTCCTCATATCCTTCTGGGGAAG 

GCTTCTAGTGAGTCCACAGATAGCTGGACCAGGCATGTCCAGAAATAATC 

TGATTCTCACATTTGAGTTAGCCAGCGTTCCCAGCTATATCCCCATTTTG 

TGTCTATATAAGTTACCAAAGCCCACAAGGATATTAGGTGGCTCCTTAGT 

TTGCTTTATGATTATGCCTTGTGTGTGTGTGTGTGTGAGTGTGTACGCCT 

ATGAGGATTCCTTCTCTCCCGTTCTTGCTATGGCTTCTCTTCCCCACTGA 

TGGGCTGTAGTTCCCTGTCCTTTTGACTTTGGGCTTAGTCATGTGACTTT 

TTTGCCAAGGGAATGTGGGCAGAAGTAACTGGGAGCCAGTCCCAAGCTAA 

GGCCTTGGGAAGCATGGTGAGCCTATGCCAGCTCCCTCAGAACTCCTTCC 

CTTGGCCATGAAGAGAGAATAACCTGGATTGTACCTTCAGCCCATGTCCT 

AGAATACAAACATGGAGAATAATGAACTTGACTCAAAGGCTGAAGGGCAG 

CTGAGCCCACATGAGGTCAATTGAACTGCAGCTACCTACAGACCTGAAAG 

TGAAATAAACATGTATAAGTCTCTGACGTTTGGGGTTTGTTTACATAGCA 

TTATTGTAGCAGAAACTTAAATAATACTGGGGGCTAAATATAGTGGACCA 

GTGACAGCACAGAATGGTAAAATGGAGTGATTGTTACTTACATCACAACC 

CTTCATCTCTGTTGATGGACACTAAAATCAAAGTGGCAATTACTCAGAGT 

TGGGAGTCATTGAGTTGCATCATTGTTGTTTAGAATCATTGACAGTTTGA 

GCTCTAAGTGATTACAGAGATGGTTTCCTCAGCTACAGGTAAATAAACAA 

AGGCACAGAGAAGTAAAGTGACTTCTAGAGGGCTTCATTGATATTTAGCA 

GCAGAATCAGAGCTAAACAATGAGTCTCTCATCTCCAGCCTTTCTATTCT 

TGTTTCCTAGGTTGGGATTTTGGGAAATAGTGCAGAGAGATTAGCAGTAG 

TGACATGGAACAATGTGAGCCTCAGCTTCCATCCCTGAGGCTGCCTTCAT 

CTGCCAGGGAAATGTCTCTGTGTGCAGCCTTGCCCTCTGCACACAGTGTG 

TATGGCCACCTGAATAAGTGTCCTTTCATAGCGACTAATGGATTGAAATG 

GGTGCTAGAGCAGTGCTTCTAAAAACTCCATGTATTAATCATCTAGGGGT 

CTTACCAAAAACGCATGCAGATTCTGATrCAGTAGGTCTGGAGTGGGGCT 

TGACATTCTGCACTTGTAACACATGGACCACACTTTGAGTAGCAATGTAT 

TAGATCATTCCAGTGGAAACATGTATGAGTGATGGAATGAACAGATATAA 

TTAATCCAGGTCTGGTAAGTGAGGTACTGATACATATTAAGTTGAAGTGA 

ATTTCACATCAAAAATAATGGTTACACAGTGACTTTTACTGCCCCCAAAT 

TCTTTCCTTTTGAGTGGTTTCAAAGTGAACTGAGCCA 

CCTG GTTTA GTGTGTGATTAGAAGATTTGATCCAGCTTTCTCCTCCTTCT 

AATTCTTTAAATATGCAATGGCCTTCTAGAAACTTGTCTCTCAGGCTCCC 

CATGAGCCACCTGTCTTAATATCTTCCCCCCCAGGACATTTCCTGGGTCA 

AGG AAGGAA TCAGGGACTAGGAAAAGTAGAAAGGTTGCCTGACAGTGAGA 

AACTTTTTGCACTCCTATTTGTTCAATTCTAAAATGTGGGTATTGTTGGG 

GCTTCTAATTGGAATCTAACCTGAAATTCAGGCATGTCTAGCTATATATG 

ACCAAGAATTAGGATGAGTTCACTAGAAGCCTATTTTCAGGAGAGCGGTC 

AGTTAAATTGAAGTTTATGGGTTTATGGTAATGGGTTGGGGAGTTTACTT 

CATTAGCAATAGCAACGTITIU'GAATCAGAGAAGTGATTTTGAACACACT 

GTACZATAGTTTTCTCACTTAGATTTATCTCTGGGTCAACCCTTGTTGGAC 

CTATATTAGAATCATTTAGTGAAGAAAAGGTGGGTGTCATTAGGAAAAGA 

GCCATTTATTCAAATGTTCTGTTTGACATTAGGGCACTGGCAAGACTACA 

GAATCAATAGATATTTAAAAACAGCCAGGTGCGGTGGCTCACGCCTGTAA 
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T C C CAGC G7GATTTGGC *rTACTTTGGGAGGCTGAAGCGGGTGGRTTG-,- 
TGAGCTCAGGAATTCAAGACCAGCCTGGTCAACACGGTGAAACCCTATCT 
CTACTAAAATACAAAAAATTAGCCGGGCATGGTGGCAGGCGCCTATAATC 
Z CAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCG 
GATG7TGTCATGAGCTGAGATCGCGCCATTGCACTCAAGCCAGGGCAAGA 
ATAACAAGACTCTGTCTCACAACAAACAAGCGAACATACGAAACAAACGT 
AACATC CAAACTAGCAGGTACATGCCGTGCCAGTCATGACC CATGGTCAT 
AAGATGTCTACAGCTCAGGAAGCAGCTGCACAATGCCTGCATAGACAAAC 
TCTTATGAAAGCAGAATGTCCTGATGTCTCCATAACACATAACAGTGTAT 
GC7TTTATTATGGTCATACTCTAGCTGTGATGTACCTACGCTCTAATATG 
CCAACGATAGTTTTCTTTAAATCATCAACATAATAAATGTCATGCTGTCA 
GTCCCCCACATGTAGACATAACTTAGCTGGTACATGGATAAGAAACCTAT 
ATTAGATAACCTTAGGCCAGGTGTGGTGGCTCATGCCTGTAATCCCAGCA 
CTTTGGGGAGGCCGAAGCGGGTGGATCACGAGGTCAGGAGATCGAGACCA 
CCCTGGCTAACACAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAA 
TTAACCGGGCATGGTGGCAGGCACCTGTGGTCCCAGCTACTCAGGAAGCT 
GAGGCGGGAGAATGGCGTGAACCCAGGAGGCGGAGGTTGCAGTAAGCCGA 
GATCACACCACTGCACTCCAGCCTGGGGGACAGAGCGCAAGATTTCGTCT 
CCCAACCCAAAAANCNANNNNAAATTTGCACCCAAATCTGACTAATTCCA 
GAGCCAATTCCAATTTAGAATCGTTATATCTCCCTGGTGAACTGAAGCTT 
TTATCTTTAAGGAGACACACTCTTTATOTCTACCAATGCTTATTGCCTTA 
AAGTCCACTTTGTCAGATACAGCTGCTTTCTTTTAATTAGTTTTTGTGTG 
GTATATCTCTTT CCATC CTTl'TTCTTTCAGCCTTCTCCATTCTTACATTT 
TAGATATATTTCTTTTTTCTTTTTTTTTTGAGAGAGAGTCTCACTCTCTC 

gcccaggctggagtagtgcaatggcgcgatcttagctcactgcaacctcc 
acctcctgggttcaagcaattctcctgcctcagcctcccaagtagctggg 
attacaggagcccaccaccaagcccagctaatttgttgtatttttagaag 
agatgaggtttcgccatgttggccaggctggtctcgaactcctgacctca 
ggtcatccacccacctcggctttcccaaagtgttggtattacaggcgcga 
gccaccatgcccagctgattttagctgtatctcaaaaacagcatgggttc 
tgtttgctttccttattcagctttataatgtaaatcatttacatcaaaca 
tctaatac^ccatggactgtaaaacacack:catattttatgtatgaatta 
aaaaaaaaaacaccaccaattagttcctgagacacacaccttaacaatat 
ct ctgt gatgtgcataaatcaatcacatcagtttctctg cacct caaaat 

TTCTTTCCTCAATTCTCAGAGATATGGCAATTTCTCTGGTTTTACATTCC 

CAGAAGCAAAGAAAAAGTACACAGCTTCTTCAAGTCATGAGTAGCTTCTT 

TTTTATAGCTCTTGGTGTTTGCAAAAAAGATTGGAATTGCTTCACTAATA 

CTAAATTTTCATTCTGCTGCTCTGTTTCTATGACAAGTCAGAGGGCATCT 

TTTTGAAGACATTCTAAACAGCAATTAAACTCAAAACATGTAATGACAAT 

GACACACAAAACTCAACTGATGACCAAATGAAGAGTTCCAGCCAAGTTGA 

CACAAGCTGGCTGACAGAGCTTGTAATACACACAGCTTGGCATATGCCTC 

GCCATTTCZAGAGATGTAAAAATAGGAATAAATGTTTTCCCTTAAATCAAT 

GAAATAGAGCATTTOGACTGAAAATCTACGACA GTTA T AGTC 

TCATTATTCTCATTCrGTTTCTTCTCCCCCTTGCTTTCTTTTAGTTTGAA 

TATTTTCTATCATTTCATTTT'rClTCCTACTAGTTTGAAACTTATGCATT 

TATTTTCTATTTTTTAGCACTTACCTAAAATTACTCTGTAATCCATGGAT 

CCTTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCC 

CATCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCC 

AACTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTT7TAAAA 

CTTTCTAAAATAG CATTATTTT ATAGCATGAGTGTTCATTTACATTTTTG 

CATATATTTAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACT 

CCCCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGC 

ACTATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAGTTTAAAC 

TAAGTAAAATTGAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCC 

ACATTTCAAGTGCTCAGCAGCCACATGTGACTAATGACTACTGTACAGCA 

AACATATAGAACATTTCCATGATGGCAAAGAGCTCTATTGATAGTGTTCA 

TC CAGAGTTTCTGTTCCAGGACCAAACTGAGGGTTGGGCTGCTATTTCTC 

ATGGCCCAATAACAAGATGCAGATGAGCTGGGGAGGAAGAGAGTTTTTAT 

TTCTGCAACCAGTTACAGGGAGAAGGCCTGGAAATCATCACCAGGCCAAC 

TGAAAATTATGACGTTTTCCAGAGCTTATATACCTTCTAAGCTATATGTC 
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TACGTGTAAGTGTGCATlJACCTGAAGACGTAAGTGATTAACTTCTtTlH 

ATCTGTAACTAAGGTCTGAGTCCGGAAGATCTTCCCCTGGAGCCTCAGTA 

AATTTACTTAATCTAAATGGGTCCAGGTGCTGGGGTAATTACCCTTATCT 

TGTC C C CTGCTAAATCATGGAGGTTTGGGGAATTC C CTTTAGAGCAC CAT 

TAACCTGTTTGTT.GAAGGCCTGGGAATTTCCTCCAAACCCCCATTAAACC 

TGTTTAATCCCAAATTGGTTCCGTTAAAAATTCCCTCCTTAATTTGTCCA 

ATJT7AAAGGCCCAAAAAAGGCTGGGGCAAACTCCTGAATGGCCTTTGTT 

ACATTCCAACCTTTGTTTAAAAACACCGGTTTTTAATATTTAACTTAACC 

ATTTAATCTCTACTGAAACACTTGTTATATAAATCTGCATTAATGAGAAC 

TGGCCTGCGCCATATCTCCTTCTCAGAATATC7TAGGGTTGTGATCCCCT 

GTGTGAAGAGAATATATCTCTGGAGATCTCAATCTCTCTACCCCAAAAAA 

AATCTCACTCGGAGAAAACTCAGACTCTTATCTCCACAGCGCTATCTCTC 

TCCTCTCC 
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GCTTGTCTAAGATGGTGCTCCTTGTTGCTGTGCCTGCTTTCATCCTGGGA 

rCTCCCTTCACCATCAGGATTGCCTTCACCTCATTCCAGTCTTGGATCTT 

TCTTCTTGTTTCTTGAGTATTTT TT TTTTT TTTT TGCTGCATTCCCTTCA 

GTGGCCTCTTGGGAAAAGATGTGTAGGGAGAAAAATTTTCTTTAGAAACT 

TGCATATCTGACAATATATTTATCCTATCCTGACATTTGGTAGATAGTTC 

AGCTGGGTACAGAATTCTAATTAATTTTCCTTCCTGATTTATAAGACATT 

GCTCCATTTTCTTCTGGCTTCGAATATTGCTGCTGAGAAGTCTGACACCA 

TTCAAATGCCTGATTTTTTCCATGTGATTGTTGTTTTCTGTCTGGAGTGT 

TGTAGGATTGCCTCTTTATCTACAGTGTTC7GAAATTTCATGACGTAGGT 

CTTTCTTCATTCATTATGGTAGACACTCAGTGGGCCATTTAATCGGGAAA 

AACATGTGTTCTTCAAGTTCTACAAACTTTATTACTTCCTTTTTCTTGTG 

TCTTTCTCTGGTCTGTTTTCAGCCCCGAGTCTCTTAGATCTGTCCTCTAA 

TATTCCTATTGACTTTACTTCATTTTCTAAGTCTTTATCCTTTTGCTTTA 

CTTTCC GAGA GACCTGCT TAACC TTATCTCCCAACTCTTTTATTGAATTT 

CATTTCTTTTACTATATATTTTTTACTTTGAATACACCTCTCTCTTCCTC 

ACATTTTCCCCCATAGTATTTTGTCTTCAATTGACAGTTCTACTATCTTA 

TTACTCTGGAGATATTAATAATAGTTT rr AAA riTlT ATTTATTTTTATT 

TTCAAAACAGTGTCTTACTCTGTCACTCAGGCTGGAGTGCAGTGGTGTGA 

TCATGGATCACTGCAGCCTTGATCTCTGAGCTCAAGCTATCCTCCTGCTT 

CAGCCTCCCAAGTAGCTGGAACCACAGGCATGTGTCACCATACCCAGCTA 

ATTTTTTTGTTTTTGAGGTGGAGTCTCACTCTGTAGCCCGGTCTGGAGTG 

CAGTGGTGCAATCTGGGCTCACAGCAACCTCTGCCTCCTGGGTCCTGGTT 

CAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGAAACA 

CACTACCATGCCCAGCTA A TTTTTGTATTTTTGTAGAGACAGGTTTCACC 

ATGTTGGGCAGCCTGGGTCTGAACTCCTGACTTGTGATCTGCCCACTTGG 

GCTCCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGCACCCGGCCACT 

AATTTTTAAATTGTTAATAAAGACGACK3TCTTGCTATGTTGCCCAGTATG 

GTCTTGAACTCGTGGGCTTAAGTAATCTTCTGCCTCAGCCTCCCAAAGTG 

TTGGGATTACAGGTG TGAGCCA CTG AATCT GA C AliriT rA AAAG TTTT C 

TTCTCTTTACCAAGTCTTlVrrCCCCTlTCrGCrriTlTGGGTTGlTrrA 

TTTTGATCTCTATCTTGCTAGAAACTTTCTGGAGACGTTTAGTAATACTA 

GATTTTTGAGAGTGGGCAACTGGAAAGCTGATTGGAAACTCTGAATACAT 

GGGTGAGGCTTGTTGGCTGTGAGTGTCATTGCTTGATGTCCTGGCAAGGC 

CAATGGGTTTGGGACCCCTACTATTAGTATAGGCCTGATTCCCTGGGAAA 

GGCTCTTTTGATCTCCTGCCTGGAGGATAAAGGCCTGGCTACCAGCCTTC 

TGTGTGTAATGTGAGGGAG AAGGG CTGGAGTATTCAACATCATGCTGAAT 

CCTTTCAATGATCATCTTGTTTTTAGTAATCTCCTACCTTAACTCTCTGT 

CTT CTGCTAGTATGGGAAAGATGACCTGAAAATCTAACCATTTATTTTTC 

CCCCATTAATATCATTTTATGATTATTCAGAAGTTAAATAATTGTCATGC 

TGTCCTCCAAAAAGACTGAATCAACTAGCAACAAATAAGAATTTTCTCAC 

AGCTCTGCCAGCATTTTAAAAGAATAGCTTTATTGAGCCCAGGAGGTCAA 

GGCTGCAGTGAGCTGTGATTACACCACTCTACCCCAGCCTGGGTGACAGA 

GCAAAACCCTGTCTCAAAAAAGAAATTTAAGGAACAGCTTTATTGTTGTA 

AAATAGACATACAATAAACAGAGCACATATTTAAATTGTGCAACTTATAC 

TTTGATATAACCCTGTGAAAACATCACCACAATCAAGATAGTGAATATAT 

TTATCACCTCCTGATACAGTTTAGCTCTGTGTCCCCACCTAAGTCTCATG 
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TTGAATTGTAATCCCCAATGCTGGGGGAGGGGCTTTGTGGGAGGTGATTG 

AATTGTGGGGGTGCACTTCCCCCTTGCTGTTCTTGAGATAGTGAATGAGC 

TCTCATGAGCTCCCCTTCACTCACTCTCTTTCCTGCTGCCATGTGAGGAT 

GTGCTTGCCTCTTCTTTGCCCTTCTGCCATGATGTGTTTCCTGAGTCCTC 

CCTAACCATGCCTCCTGTACAGCTTGCAGAACTGTGAGTCAGTTAAATCT 

CTTTTCTTCATAAATTACCCAGTCTCAGGTGGCTCTTTATAGCAGTGTGA 

AAAGGAACTAATATACCTCCTAAGT7AC CTCAAG.CTT CTTCTTAATTC CT 

TCTC CT CC CTTC CTTCATTGC CAAGCAAACAACCACCTGTTTTCTGTCAC 

TATAGATTAGTTTACATTTTGTGGGTTTTTTTTTTTTTTGAGACAAGGTC 

TCACTCTGTTGCCCAGGATGGAGTGCAGTGGTGCGATCATAGCTCATTGC 

AGCCTTGAACTCCTAGTTTCAAGTGGTCCTCCCACTTCAGCCTCCTGAGT 

ACCTGGGACTACAGGGGTACACCACCACAACTGGCTTAAAAAATTTTTTA 

AATAAAAATGGGGTCTTGTTATGTTTCTCAGGCTGGTCTCGAACTCCTCG 

CCTCAAGCAGCCCTCCCTCCTTGGCCTCCCAAATTGTTGGGATTACAGGC 

ATGAGTCATGACTCCTGGCCTAGTTTACATTTTCTAGAGTTTTGTATAAA 

TGGAAACATACAGAATGTAT1"1"I"1 " 1"1'GCGGAGTGGGGGAGTGTTTCTATT 

TCTTTCTTTCTTTTTTCTTTTTTTTTTTTTTT'l'TrTGAGACGGAGTCTCG 

CTCTGTCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCG 

CAAGCTCCACCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGACTACAGGCGCCCGCCACCACACCTGGCTAATTTTTTTTGTA 

TTTTTGGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTCTCC3ATCT 

CCTGACCTCGTGATCTGCCCGCTTCGGCCTCCCTAAGTGCTGGGATTACA 

GGCGTGAGCCACCGTGCCCGGCCCAAGTGTTTCTATTTCTTAACCAGCTT 

TCATGCAATCTTTITTTATTTTACCATCTCTGTGATCCCACTCCCAAAGG 

TACTAGATGTCGATTGGTCCTTAGGATCAGCTACCATTTGCCCAACTGCT 

TTCCAGCCTTCCAAAAATTTTTTTCTTTTTTTCTTAAAGATACTCCTGTG 

TGAGGCTCAGAACTCTTGAATTGCTACTGCAAATATGAACTCGGTGATGT 

GAATGCCAGGGAATTGCCTGATTGATCAAAGAAATGTATCCCCTTCTCCC 

TCACTCTTGCTGTCTTCTCATTTGTTTTCCCCATCCTTGTGGATTCGTGA 

ATTTAAATATCCCTTTAATGTTATAATATTTTAATGGCGTTTGGCGAAAA 

GTACAGAATTAGGTGCAAGAGTGCATAGCTGTTATTTTTTTTTTGGCCTC 

TGAGACTGTTCATATATGCAAGTTATTTAACAGAAAGTTCTGCAGTGACC 

TGAGATGTCAGGGGGGTCTGATAGAGTACGTTTGAAGGCAGTTACTGGAA 

AAAAATAATGCCATTTCTGGTTTGTACTTCGGTAAGTTCAGATGACCCAA 

TATATTGTTTACATGTGGCATTCAGTAAAAAAGTAGCTTCCCCTCCCTTT 

CTTCTTCCTTTTCTCCTTTCCTGCTTCTATAAAGCATCTGCTTTGGGAAA 

CTTCTTAGGAGGAGAGCTTGCCAGCCCGTGGGTAATGGAGAGGTCTTGCA 

GAGATAAAAGAGATGCTCCCACTCAATGCAGGATGGTGTGGAGGTAAATG 

GGGATACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAGAGA 

AGGGAGGGGAAAGAGGAAGGGAGTCAAAGATGAATTGCTGAATACGGGGA 

TTCCAGGGCCTGGAGCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAG 

GGCATCAGCTGATGAGGAGCAGCCTGAAGTCCGGGGAGGACCTGTTTTTG 

GTGGCCAGGAAGAAAGTGCCTTCCACACACAGGGAGGCCACAAGGCTGAT 

GGGCTGGGGGTTGGAAGGACAGCCCTAGGACAGGCTTGGGAAGCAGGCTC 

AGGTAGGGACTGCGAOTTTCTTGTTGAGTCTTTTTCATTCCTGGTCTTAG 

AAAATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGG 

CAGATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCT 

CTCAGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAG 

ACAGTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTC 

AATGAAAGTTTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAA 

TATTCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACT 

TCCTGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGA 

CAAAGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCA 

GTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGC 

AGCAGAACCAGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACT 

CAGTCTCTAAGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGA 

AATAATTCATTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGA 

CTCTCTTTACCTTTCCCTGGGATGGAAGATGCAAATGTTCCTGATGTCAC 

TGTCAAAAAAGAAGAACCAGTGGGTATATTGTATGCT7GAGTTCCAGCCA 

TTTGTCACAATAGATAGAGATGACTGCCATGTGTGTAGACTTTCTATAGA 
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CTGTGTGCTAAACCCQAcCTGCCACTTCCAAGGAGTAGATGAGGAATGaC 

CA7GG77C7GGGGAGCCC7ACCCCAA777GGGGCAGACA77CCAAAGC7C 

A7777C7G7GGAGGGGG77GA7GG77AAAGGACGGCC7GGGAG7AAC7CG 

7C7G7AC7AGGGCCCAGGAGAG77ACA7GC7GC77CCCA7G77A77CA7C 

ATTC C C C CATGTGAATAGCTATGGC GTGAGGTCCAAGGTTAGGGCCTTTC 

TACCATAAATGGGGGAATAAAATTCCCCTACCAGCCTGAGAAGTTTCTGT 

TAJAAAGAGGCTTTTT'rrrTGCGGGGGTGGGGGAGCAAGCCGACTAATGT 

GTTATTTCCATACGGTTTGTTTTAAAATGTAGATGTCATATGCAGGAGAG 

GTGGTGTAGTGAGTCACAACGGGATTAGAAGGACCAGTCCGAAAAGCAGA 

AGAGGGTCAAGTTCAGGGCACTGAGGACTACTGCATTCAGTGGCGTGAAA 

GGCAGATGGCTGAACAGGAGGGGGACATTACATTGCTTGTTCTCCTTGAG 

CCTCGATT7CCTCATCTAAAAAGAGGGTCATTTATTCACAGAACATTTAT 

7AAAC77G7GCCAGGCACCG7GCCAGGAGC7GGAC7AAAAA77AAA7CCA 

CCCCTGTGAGCTGCTCTGAAGGCTAAAATATGAAGTATGTAAAAGTAACC 

AAGTGCTGTACACA TGCA GCTATTCAATGACTGTGTGGGCATTGCGGCAG 

ATTTTAATTTTCTTTTTTATTTCTTTCTCTTTAGTGAGAGGIGTTGGTTG 

TTATTATTGTCGTCGCTGTAACTGTC7ATTTCACTTG C TTTTTTGTTGCC 

TCCAGCCCATTCCAGGGCTGTCATCTAAGACACTTCTTATCACCTAAATA 

ACCGGGGAGGCAAAGCGCTTTCTTAAGAGATGGATCCAGAAGAACAATGC 

TGGTTTTCTGTAGAAAAAGGGGCTGTGGGAAGTAGAGATAAGAAGGGAAT 

TGGCCAAGATGAATGTACAGAGCCTTATTTTTTTTTTATAACACAGCAAG 

ATTAGATACAAAACAGGACAATAGCATCATCTGTTTTTATAACTGGAAAG 

GACC7CAC777ACAGG7GGGGAAGAA7AGAG7GGAGAAG7GAAGAGAA7G 

3TCACAGAGTCAATCAGCATGTCTGCGTCAAAGCTGGGATTCCCAATTCA 

GGGCTCTTACTACAGTGACGTATGGCTAATATTTTGGCATTGTTTCGGGG 

AAAAGCTGAAGCCC7GATGGTGTACGTCACTCTTGAGATAGTCTGTAGTC 

CAGCAGGGAGGAAAGCAAGGAAGGGAGG7GGAGGCAGCA77777GGG7G7 

AACA777CG77C77G7777G7GGCCAAA7CA7AG7G7GA77GGGACAAGC 

CAC7GCCT77C7C7GAGCC7CCAC777C77777C77C77AAGAGGGAGGG 

AATAG7AGAG7AAAAG7AG7CA7T77A7CAAACACC7GC7A77TTGGAGC 

CATA77GCAAG7GGG77GGGGG77GAACAC77GGC777A77ACCCA7AGG 

A77AAA7CCAACC7CGA7AC7G7GGCA77CCCAAAC7CCAG7C7AA7C77 

C77C7CCA7CAGCCA7GCCCCACGACACCC7GG7CA7A7C7GA7G77GCC 

CCTTGCAC77GCCCCC7CC7TA7C777GC777C7GACC7ACCA7A7GGC7 

A77GG77GAAA77C7CA7777CCAGGGCC77GC77AAA7A7CA7C7CA7C 

CA77AAAAC777C77GAACC7CCCC77GCCC7G77CC7CCC7AA7G7C7C 

AAGC C AGAA777A777CC7777G7GGCCAAGGGAC7GGG777G7GACC7C 

7C7CACGAGAC77AA7A77GAGACCAAACG7C777AGACC7CACCAGCCA 

3AGAGA7GAGCA7C7A7GGAA7GCAGGC7777GCC7GGAC77GC7GA7GC 

AGGGCC7C7GCC77CC7CCAGGGCC7C7CC7GC7G7777AGGAA7T7CCC 

TCA7GGCACAG7CCA7GAGC7CAGGG7CAAGT7CA7ACA7G77777 A C77 

C77C7AC7C7GCAAA7GG7CT7C7TGAAC7CTGAGGGTCC7AAAGCTGCT 

C7GCAG777G7GGGG7GAG7AGAAAGGGGC7T7CAAAAGT7G7GC7G77G 

777CCCACCCCAA7AGCA7GAAACACAAAGA7GC77ACAAA7AGC7GCC7 

TGC777C7AG7CCCAAC77C7C7C7CC7GAGGC777AAAACAAG7CCCC7 

AGGT7GAGC7GGAC7GGAGTTGTA7CC7A7CT7CA77ATC7G7C7AC7C7 

C777C7GC7C7C7AGAGAAGATA77A7A7A7G7G7G7A7G7A7G7G7AAA 

7A7A7AA7A7CCA7A7A7AGAACA7A7A7TG77A7A77TACA7A7ACA7A 

CA7AACA7A7GCA7G7A77CA7A7A7ACA7A7G7AG7A7CAAAG77GGAA 

T7AAAC7G7A7A7777G7AA777GC7777A777GCA7C7A7CAC7G7AAA 

A7GAA7 A777A7 CCA7AC CG7 AAGA7A77C77CAA7G7 A77777777777 

777GAAACAGGG7Cr7GC7TTG77GCCCAGGC7GGAG7GCAA7GACCCGA 

7C77GGG7CAC7GCAGCC77GACC7CCCCGGCTCAAG7GA7C77CCCACC 

77AGCCC7C7GAG7AGC7GGGAC7AAAGG7G7G7GCC7CCACACCCAGC7 

7777AA77777777G7ATTITr77T77AAAGACAGGG7777GCCACA7TG 

CCCAAGC7GG7C77GAGC7CC7GGG7CCAAGCAA7CC7CCCAC777GGCC 

7CCCAAAG7GC7AAGA77ACAAGCA7GAGCCACCACACC7GGCC7CAA7G 

7AA77777AA7GGC7G7A7AG7A77CCA7CA7G7GG77G7 ACCC AAAA77 

A777AAC CAGTCCCCAG777A777CAA777777777AC7A7777GAA7AA 

7G7777AG7AAA7ACCCACAAAA7A7G7ACAA7GGC7GGGC77AG7GGC7 
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CACCCCTGTAATCCCAAv^CTTTGGGAGTCTGAGGCAGGTGGGTCACCt'G 

AGG7CAGGAG77CGAGACCA7C77GG77AACA7GG7GAAACCCCG7C7C7 

ACCAAAAATACAAAAATTAGCCGGGTGTGG7GGCACACACCTG7AATCGC 

AGC7AC77GGGAGGC7GAAG7AGGAAAA7CAC77GAACC7AGGAGGCGGA 

3GT7 GCAG7GAGC C GAGA7CACAC7AC7G7AC7C CAGCA7GGGCAACAG7 

GAGAC7CCA7C7CAAAAAAAAAAAAAAAAAAAAAAAAG7ACAA777G77G 

TACC7CC77GA77A777CT777AAG7AGAA7777C77A7AA77777777A 

7AAG7AAAA7777GAA7CAAGGGAGAAGCACC7GGAG7CC77CAGA7ACC 

TA77GCCAAAC7GAAC7777C7G77CCAGG777AC7ACA77CAGCC7GAC 

7CAGGG777GGGGAG7AGAGGAGGGGG7GGAGGCAGAGGGCC7C7CCC7G 

7CCCCACAGACC7CCC77GG7GAGG7CCAAG7C7GGACAGG7GGAG7G7G 

GCA77GCACCG7CAGG7CC7GC77CC7G7AA77CCCC7AAA7CCA7CCAG 

TGGAGCC7CA77G77CAAGTC77Tr r rTrrmTl T 77T7777 , r AAC7CCC 

C7GAAGACGGAG7C7CACTC7G7CGCCCAGGC7GGAG7GCAG7GGCACGA 

7C77GAC7CA777CAACC7C7GCC7CCCAGG77CAAG7AA77C7CC7GCC 

7CAGCC7CC7GAG7AGC7GGCAC7ACAGGCG7G7ACCA7CACGCCCGGC7 

AA777777777G7A77777AG7AGAGACGGGG777CACCA7G77GGCCAG 

GC7GG7C7CGAAC7CC7AACC77G7GA7C7ACCCGCC7C7GCC7CCCAAA 

G7GC7GGGC77ACAGG7G7GAGCCACCAGGCC7GGCC7CAAG7C7A7777 

77AAC7CCAGGAGGCC7GG7A77CAGAGGGA77AGGGC7GGCAGAAGGGC 

C7CAAAGC777CAAGGCCTGGGGAA7AGGC7GCAGCC7GG77CAGGG7AA 

CCCAAG7GA7777GG77CCAAAGGGACAGGAAAAAAAG7GA77GA7A7GG 

AAG77G7CAAAG7GCAAC7G7CAAGACA77AAAAAA7G7AACCC7777AC 

7AA7A7ACAG7AGAC77G7G77AAA7A777AAC7GA77G7AAAAGGAAAA 

AACCAGACGCAG7777CCC7ACCA7AC7G7CACAACACC7CAACAC7GAG 

77C77C7G7GACC7C7AG7CACCGAAA7GC77GGGGA777C7CCCACCAC 

7AG7CC7CCAGCAGCCGACACCAG77GGG7G7CC7AA77CAC7CCAACAC 

7ATCTACC7GGAGTTAGCGTTAGATCCCACAGGTTGAGGGCTCAGTCTCA 

CAAGAC7GCC7CCCAC77CAGG7GCCAG77ACAAG7GG7AGG77G7CACC 

7A7GC77C7GAC7GA7GGC7A7AAA7C7GGG777GC77CCC7CGGG77CC 

G7GAA777GC7AGAGCAGC7CACAGAAC7CAGGAAAACAC77AAG777AC 

CAG777A77C7AAAAGA7A77ACAAAGGA7ACAGA7GAACACCAGA7GAA 

GAGA7GCGCAGAGCAAAGCA7G7GAGAAGGGG7G7GGAGC77CCA7GCCC 

C7C7GGGGCACCACCC7CCAGGAACC77CA7G7G7CCAGCTA7C7GGGAG 

C C C77CCAAAC CC7G7CC777T7GGG777TTAAGAG7GGC777A77ACA7 

A CAC A7G AT7GAC CGAAC CA77GGC CA77GG7GAC7GACACAAC C77CAG 

CCCC7CCAC7CCC7CCAG7GG77GGGGAG7GGGGC7AACAG7C7CAAG7C 

7CCAA7CC7GCCTTGGTCTTTCCTGTGACAAACCCCATCATGAAGCTAC7 

GCATTGGGGC7GCCAGCCAGCAGTCATCTATTAGCATGCAAAAGACACTC 

TTATTATTCCAGAGA77CCAAGGG77777AAAAGC7G7A7G7CAGGAAAC 

AGGAGA7GAAGAACAAA7A7A7A777CACAACA7CACAC7CG77GGGGGA 

A77GACAGGA7AGCAAAACTGAT7AAAGGAGGA7AGGAGAGAC7GAGATA 

7A7A777CCA7A7A7A7A7A7AGAGAGAGAGAGAGA7A77TCCA7ATATA 

TA7A7AGA7C7AGAGAGAGAGAGAGA7AGAGAGAGAAGAGTC777CC 
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ACACA777GGGGGAGCAG7TCCGGAGG7ACAGCCCGGACAGGAGA7G7GA 
GAAGA7CG7GG77AOTGT7CCCC7GG7CCAGAACCCC7CCAAGTGGGCTT 
AAG7AGGAAGGG7GG7GAGCGGCAGG7AAACACACG7CAAAGGCAG7C77 
CC7C7C7GAGGGAAAACACT7GTA7AAGCA77GCAA7CAA7GGGCC7C77 
7AA77A7G7GCCAG7GGCAAGAGCGGG7GC7GAACCCAGGGGCC7GCC7C 
AA7CCGGGGCC777GAGGCAGAA7AAAG7GG7C7CAGG77G77GGCA777 
CC77GCCC77CCACCCGAAGCAGACACAAA7CC7C7C7GGAGGCAAG7TC 
CCCAA77CAGCCAG7ACAACTCCCACAGAC7AAGA7CAA7CA7G7ACAAG 
C7CACAGACAAAGG7CACCAAACACACAGAGCAA7AAACAAA77CA7GAG 
7GACG7GAA7GAGAA7AAACAGAAACAA7AACCACCAGC7GGGA7GC7C7 
AAG7C77CAGC7G77AGAA77CC7GAA7A7AGAA7AAAAC7GCCACAA7G 
GCAAACA7GCA7C7AG7AC77AC7G7G7GC7GGG77C7AAGAA7777GCA 
CA77G7GCCAGA7ACCGACTCAGC77CACAC7CACCC7CC7AC7G7GCCC 
7C77AA777GCAC7AGA77AAAAGG7AGAAAGGAAGAGGCAGC7A77C7G 
77C77GGC7G7GCC7C7GGCAGCACA7GCAAAA7GGGCAG7AACAG7GGC 
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AG7CACAGG7AAG7AGC ITCTCACAGa ^ rGGAGTTAAAGGCATGGGAv. J 

G AGAC GAG CAAGGTT C CT AAAGGGACAGTGGC CAGT AAATG AC CAGGGGC 

TAC7GGAG7GGC7GCA7GGCTC7G7GGAAGC7CAGAGGAGCCT7GGG7CC 

TGCAGGTGC AGTAG CAGCTTTCTGTAGTTCCTGATCTCTGGGTCCCACAA 

TCTTCCCCGTTTTTGCTCCTCCACTTCTAATTTTGTAACTGACTTCCCTG 

TG7G7AC77C7C7CTC7GA7TGAAA7AGCCAGAC7GG777C7G7T7CC7G 

ATAAGACATTGTCTGGTACGAACACAGTAACTCATTTAATCCGATATCTC 

TATGAAGGAGGTACAATAATTATTCCTATTTTACAGATGAGGAAACACAG 

CAGAAAAATAAAGTCAATTGTCTAAGGTTGCACATTTAGTCAAGGGAAGG 

GTTGATATAACATATAATTATTTAGAAAACATCTAAGGAAATAAAAGGCA 

TAATTTAAAAATAAAACTAGGCAGGTTTAAAAAAATGAAGTAATCTATAA 

GTAAAAAAGTATAATTGTTGAAATACATATCTTAGTGGATGGGTTAAATA 

GCTGAAGAAATGATTAATGAACTGGAAGGTAGTTCTGAGGAAATCAGAAT 

TCAGCATAGATAGAAAAAATGGGAATTTACAAAAGTACACAGGAATTATA 

AAAGAGG7TAAATTATAGGGAGGGTAGAATGAGAATTAACATTGGTCTAA 

CTGGAATTTTGGAAGA AGAG AATAGAGAGAATGAACAAGGCAATATTTAA 

AGAGG7GGCTGAGAATTTTTCAGAACCAACACAAACTATGACTTTACCAG 

TAGAGAAAACAATGTACACTGAGGAGGATAAATAAATATACTATGAACAA 

ATTGTAATAATAATACTCAACAAAGACAAAGAGAAGATCTTAAAATCAGC 

AAAAAAAGAAAGTCAGACTTAGAAAGAAATGACAATGGCAGACTACTCAA 

CAACAACAATGGAAACCAAATTCAGTGAAACAGTATTTTCAAAATGCATA 

777AA7C7A7C777GAAGAATAAGGG7GAAAAGGG7GAAAA77GC7GCC7 

TATACAAAATATCAACATTAACAAAAAGTAATGAAGGTAATATAAAAATG 

TTTTCAAATAAACAAAACTGAGAGAGTTTACCACCAACAAGCATTCATTA 

AATGGACTTTTAAATGCAGTTTTTAGGAAGAAGGAAAACAATTCCTAAGG 

AAGGTCTGAGATGCAAAAAGGAATTATGAACAAAGAAATTGTTAAAATTA 

TAGGTGAATTAAAAAAACTGCCTGCATAAATGATAATAATGACAATGATG 

CTATTAATAATGAGTTGATAAGGATAAAGAAAAGGACAGAATTAAAATAC 

TAGAAAACAAGCATGCTGGAAAGGATTCAGGAATTACTTGAAGGTTAAAG 

TTCTAGGGTCCTTCTATCCTTCTAGAGGGGAGTCAATATATTAATTrTTG 

ACCGTCACTTACACAGTGAAAAACTTTAAGGATAACCATAAAAAAATAGA 

AATAGAGAGTATAACTTCTGAAACAGTCAAGGGAAAAATATGGAATAAGA 

AAACTGACCAAAAAACATCTCAGTCAATCAAAAAAAAAAAAAAAAAGAAA 

GAAAAGGTTCGGAAGGAGAAAATCAAAGCATAGAAAAAGCGGGACAAATA 

GAAGTGGAAAAGAAAAAGGTAGAAGAAACAGGTCCAGAAATATCACTGAT 

GCACTAAATCACCATTAAAAGATGAAAACAAATGAACAACATCAAAAAAT 

TCTAGTGACTGTAGTAGTGCTGATCAGAATAGGCTCTAAGATAAGATGCA 

TTATTGTGAGTCAACTTGTGATGATGAAAGGTTTAATTCACCAGAAAGAC 

ACAA77A7AAACTTGTAATCAAATAGTTTTATT77ATTTACTTTATTTA7 

T7A77777777GAGACAGGATC7TG77C7G7TGC7CAGGC7GGAG7GCAG 

7GGCT7GATCTCAGCTCACTGCAGCCTCCACCTCTTGAGGCTCAAGCTTT 

C77CC7GCC77AGCCTCATGAG7AGC7GGG7CCACAGGCACACACCACCA 

AGCCC7GC7AATTT7TGTATTTTTTG7AGAGA7GGGG7TTCACCA7GTTA 

CCAGGC7GG7CTCAAACTCCTGGGC7CAAGCGA7CTGCCCCCC7CGGCT7 

CCCAAAG7G7TGGGATTATAGGCG7GAGCCACGG7GCCTGGCC7CAAATA 

AC7ATTTAAG7GAAACAAAACTAG7A7GGCAC7AA7GAAAAA7G7ATAAA 

7CCA7AA7CGCAGAGGGATTTCAACT7ACT7CTTTCGATTA7GTAAAGGT 

CAAACAGACAAAAGACAATGACAAAACT7AATGCAATGAACACTTTTGA7 

77AA7GAACATATATTGGATATGTACCCAAGAA77AGAGAA7ACA7ACTA 

G7777GAGTTTATGCAGAACAT7TACAAAAAT77AGTGGAAGCCTAAATT 

A7AAAAAGT7GCTG7CACGTAGAA7AACACACAAACCCCTGAGTCCGGAA 

77CAAAGCCCTCCACAC7CTCC7C7ACC7T7GCA7CT7TATCCTCCACCA 

CAC7GCAG7GCATAC7CTGGGC7AC7ACTCAC7GT7C7TGAT7CAAA7TC 

CA7G77C7G7CAGCTCAAATCA7TCTCTC7GCC7GGAATAACTACTTCA7 

ACA7A77CTGC7A7TGAATTCT7G7C7TAGCACCCCA7CTAC7CCAAGAC 

GA7GTCCAGT7GGGG77ACTCCC7G7CCCA7777C7TTGA77ACAC77TT 

T7T77C7AC77CCA77A7A77A77GA7CACA7C7G7GCCACAG77T77GA 

C7T7G7G7C7GCTT7TAC7C7T7TC7AGACCC7GAGAGCTCC7GAAGGG7 

7GGG7CA777C7T777TA7T7GCTCA77CC7CA7GGCACAG7GAG7GC77 

AA7AAA7GGC7AT7GAC7GAAA7TAAAC7G7A7C7AAA7GGACA7A7TCC 
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ACTTCTGGGCCATTCA? iCTTTCTTTCTATTGGAACCAGGAGATGGGGAA 
CCATAACAAAGGTAAGGTTGTGCCATGTGAAAGAACATGGAACCTTCCCC 
TGAGGGCCAAAAAAGAGCAGGGAAAGGTGCAAAGACAAAATCTTCCATTT 
TTAAACAAT GTAA GAATGTGGTCCACCTCATGCTCAGGTGGGAC7TTATC 
ATGACGTTATTTTTGGGGACTTATAGCTGCATCATTTACCCCATATACAT 
TTACCTTTAGTGTAGGGAACTGAGGACAGGAATTTTGTTGATGCAGACTC 
TTGCTAATGAGGCTAACACTTGGAGAATTTTTATCATGCATTCAAGAAGC 
TTGTTTTACATTTCTTCATTAATACTTTAGTTGGTGGTTTAGCTTTAGTT 
GTAGGCTTATCAGATATTTGGAGATATCTTCATAAACGATGGCTTTGGTT 
TTAGAAGAGTTATTCTGAAGCTACTATTTCTGGCAATAATCAAACAGCAT 
GGCCATTTGTTTTGTAAGGCCTTTCCTAAAATATGACGGTAAAATCTACG 
TGTGGAAAAATGCTTATTCTTCTGTCCTCTATAAATGTGAATCTAGTTTG 
TCTTCAAAATGAAATCAAGTGATTAAAATGTAGTTTTCTAAGAAGATAAA 
TGGAGCAAAGCACTCTGTGTTTCACAGTGTTGGAAATCACTCATCCCTCA 
TAAAACTGTCCCAACTGATCCTGACTCACATGAATGAATTAAAATAAGAG 
TTAATAACATCAATTTACATTTTTAAAGACACTTTCCCATGXTTTAGACT 
ATTGGTTGGAAAAGCTGGTAGGTGTACAATTTGTGGAGAGTTGGCTGTTT 
TTGTCTGTCGTTGTTTGACGTATTTCAAAGCCATATCTAATTTTGTTGCA 
GAATGGTCTGAATTCTACAAAAATGTTGAGTTGTGTAGTGTGGAGAAGTA 
C GG A G C C ATTT ACTGAAAGGCTGGGGGGAAATG AC G AGAC C CTGAGATAA 
GGCAGTAGTGGTGCGAACAGAGTGGAAGGGAGGTAGTTGAGATATGTTCA 
GAGTAGAATCAGAATGGACATAGTGAACAACTGGATGCAGGTGGGGGCTG 
AGGAAGCAAAGTTGAGGATAATTCTGAGACTTCTAGGTTGATCCACT.GAA 
GTTACATTATTCAACACCACAAGGAAACTAGGGGAATGAGAAGGCATACT 
GGTTTGCTTTGGAGTGGAAGGGCAGTGATGTAAGAGGAGTTAATGAGTTA 
AAGTTTGGATATGCCTGAACTTCAATTTGATATGTGCATCTGATATACCC 
TTGGGGTGACCCTCCAGGCAATGGTTGAACATGTGTATTTCTTAGTAACT 
GATAGGCATCACAGACTCACATCAGTAAGGAAGCAAcCAGCAAACTTGATT 
GGACGATATACCTGGAACTCAGTACCCTATGACTGGAGCAAGTCTCTGTC 
AGTGAAATGAGGATAAGAAGAATCTTGACCTTGTGGAATATGTTGTTAGG 
AATATATGTGATGAACAACATAGGATACTTCCTACAGGGCTCCACATGTA 
GTAAGGGCTTTATAAATGCTTGATAAATATTATTGTTGTAATTTATTTCC 
AAAGTAAGATGCCACTGGAGGAATCTTTGGAACCCAAATTAATAACAAAT 
AGGACTGGATGCAATGGCTCACACCTGTAATCCCAGCACTTTGGAAGGCC 
AAGGCAGGAGGATCTCTTGAGCCCAGAAATTCAAGACCAGCCTGGGTGAC 
ACAGGGAGACCTTGTATCTATGAAGAATTAAAAAAAATTAACCAGATGTG 
GTGGTGCACGCCTATAGTCCCTGCTGCTTGAGAGGCTGAGGTGGGAGGAT 
TGCTTGAGCC CATGAGGTTGAGGCTGCAGTGAGCCATAATTGTGCCACCA 
CACTCCAGACTGGGTGACAGAGTGAGACCCTATCTCAAATAAATAAATAA 
ATAAATAAATAAATAAGTACAAACCAGCAAACACTAATCCTTTCTAGAGA 
. TTATTGAACTCTGGAGGGCAGATCTGAATGGAGCCAGCAGAGGGACCTAT 
GGAGATCAGC CTGGCCCTGGACAGCACCAGGCAATGGGGTTGCTAGAGAG 
GTAATGGGGTTGAACAGGGTTTAAGCCATGAGGTCTCAAGAATCCGTGAA 
GACTCAGACTAATTTTTTTTTTTTTGCATGAGGATTAGGTGTTCCTAGGA 
ATT7CAATGAGAGCAGGGTTAATGAAGGAATGCAGGGTAGGAGAGCTGAG 
GGAAGGCATCTGAGAGAGCCTGGCTTATGAATGGCTGCGTCAGTATGGCT 
CACCTGCTTTCCTTGTATCTACTTAGCAGATGATCCCACCCCAGGCCTCC 
AGGGCCAAGGTCATTTCCACATAGTCATGGGCCCTTGAGGGCCTGGAGCA 
GTGTAAGGAAGACAGAGTCTTAAGAAATTGCATTAACAGTCATGGTGCTT 
GGCAAGTGTCGTCATCCTATGCCAAGCCTGATCTGAAGGGGTGCATGCTC 
ATAGGTAGCTGCTGCCCAAGATTACAGCAGCTTCTTCAATCCCAGATCCA 
TGCTCTCCTATATTCATTTTTCCAGGGGTTCCTGTCCTTCGACAGTGATG 
AGATGCAGAATGACTTATTGAGTTATTCTCCTGATAGTTGCCAACTTTTC 
CAAATGACAATGGGGCATGGAGCTTGAGAGTGGAAATGAGGCCCTAGGGA 
TAGCGTGCTTAGGAAAACACTCCCAGCCTGATGTAATTCTGGGGGTACAA 
TGGCATTTTCATCATCAAGACTGATGTAAAGGGTGACTAGCAGTGAGTTG 
GGGGTGACTCGCACTGGGGCTAGGTTTCTGATTCTGCCTAATCCAGACAG 
AGCAGAAGCACTAGTGGGCTGGTAGAGGGCCTCCAGGGCCTCACTTAATG 
TCCTGGAAAAACAGCTCCAGATTGTTGGTTCACGTTCTGAGGACAAGCTT 
GGGTACTACAGGATAGAGAGAGTGGTGGGAGATGCCGTGGCCTGCCCTGC 
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TGATGCCTGCCCTGCCATI'CCTGCGTGTGATGTCTCTGGGGCATCTTGCC 

TTCCCTGCCCAGACCTGTAGTTCAGCTGAGGGCATGTGGAGGCCAAATGG 

CTTCTTAGAGTGTTACTTTCCTTGAACAGCTCTGCTGGGAGAACTGGAGG 

AGCTAGCTAGTCACGGTAACTGCAGCAGTCAAAGGATCGTCCCGG7GGAG 

GTGGGGTGGAAAGGTAGAGAAAGAGAACATATAGCGTTTTCCTTGGAGAT 

GTGTGGGCATGTCATAGAGGAAATACCCAATTCCTGAGCCTTGAGCCCTC 

CAGGAAACCTTGGAATATTAGGTTAGTCATCCCCAAGGAAGTCTAAGAAT 

TCTGGTCTCACCCATCTCCTTTAATTCCCACAATGATCCTACATGATATT 

AAGG AACACGGG C CAGTAACC CTCCAAGCAATGGATGTGGTGGTGAAGTT 

TGACCTCATGATGGAGCGGAGGTTGGTTTGAAACCTAAGAATTTAATTTA 

TTGTTTCAAACTGTTCTCCACTCAGCGTTATTAAAGCATACATAATTGAC 

ACATAAAAATTGTATATGTCTACGGTGTACAATGTGATGTTTCGATCTAT 

GTATACATTGTGAAATGATTACAACAAGCTAAATAACATACCCATTCATC 

GTGTTTCAAAGGAATTAAACTCAAGCACAAAAGAGAGGTGCTGTTGAAGA 

GTAGGGCTGCTCTATCTAAGTAGTATGTCTGGGGTTGTCCTGGATCAGGG 

TCCTTTTGTGCTAGTAATAAACCAGCCCTTCTGGGGCTGCTC€ACTTTCC 

CCACATTTTCTTCTGGAGCCTCCCTAAGAATTAGGACATGGCCACTTTCT 

CTGCATAGGCTTCCTACTTCAACAAGGACAGGGCTTGTGCTGCCCCATGC 

CACTTGAGTGTCCCTACAGCACAGAGCTGAGTGCACACTGGCTGAGTGAG 

GAAATCCCCCAGATTAATCTTGGTTCTAAGCATCATGGCTGTATTTCACA 

CGTATATGAATTACAAATTACAGCATAGTCGAATAAGGATTTTTGfGCTA 

CAACTGGAATCCCAGATTATGCAAATTGGATAGTATAATATTGAAATTCC 

TAGGACTTTTTATTAGTTTTAAAAAATTATACAAGCTTAGAGTAAGAAAT 

TAAACAGTGCAAAAGAATTCACTGTGAAAAGTAAAATGCTCTGTCTCTGC 

TGAGAGACAGATATTGCAGCCOVGATACTACTGGGG7CAATAGTTTCCTT 

TAAGCATGCCATTTTGATGGTTTATGGGACTTACAGCTCAAGAAGCTTGA 

CACTAGGGTTGATCTCAGAAAATCATTGTTGCAGGTATTAGATATGACCG 

TCTCATAAAGATACACACACAGACACAGCGATTGGAGATATTCACTGGGG 

CTTATGGGCTGCTTGTCCTTTCTGCTCTGTGCCTAAGTTGGGCTCAGAGT 

AGCCTGGCATCGGCTGTGGGGAGAATGCTGGCATGGGGTTAGCAGGAGCC 

CACTTAACATGTCCTAAGCCACCTGGAAGAGTCCTTCAAGGAGACCAGAC 

TCCAGAGGCCCTAAGGAAGGAAGGACTTTTGCCCGTTTTTAGGTATTCTA 

GTCCCAGAGTTTAGGGAGGAATGGTTTGGCTTTGGGTCGTGTGCCCCTTT 

ACCGAGTGGGATGGGATGTGCCCATGAGCTGTTGAGCTGGCTCTTGGAGA 

AGACAGCAAAAGCGGGAATAAGAGGTCAGGAAGCTGTGTGGTTGTAGGAA 

ATCCCAGCAGAGGGCCTGGGGGTCAAAAGTGGTCATGGTAGTGACGGTGG 

AGGCTGAGGTGGTAGAAAATCAGAGGACAAACCCCATGGGCTGCTGGTGA 

rCTGACCGAGCTCCTATGCTCTCCTGGTTCATTTTAGGCTCTGTAGCAGC 

AGATGATTGGCTGGTGTGAGAGCAGTGCACCTGCCATATCAGGCAATCCA 

AGACAAGTCCAAGCTACGCTGGGAGGAAACCTGAAGGCAGCAGCAGGTAG 

ACTGGCTGAAGACAGACAGGCAGGCAACTTGTOUVTCAGATTTGTGTTTT 

TAAGGACTTTTAACTGGGGAGCCCTCCATGACAGATCAGATGAGAGAGGA 

ATCTGGGTCCGCCCATGTGTCAAGCTACCAGAGGGTCCCATCGGTGCTTG 

GATCTT CTTTGAAGCTGGGTCTGAGGTTTGCAGGTAGAGGGTGAGCTGGT 

CAGAGGGACCTATTGCAGAGCTAACCAACACCTTCCCAGGAATGCAAGCA 

CAAGCACCCCACCGCGGGCAGGCGGGCAGGCACTTCTCCTTTTGCCACCA 

GGAC CTCACAGAGGCTGATCTGGCTCTGTGAGGTGGGAAAATGGGTTGTA 

CTTAGTACATAGAGATAAAAGGCTTAGGAGGCCCCTCCATCCTGTGACCC 

TGTCCCCAGACCACAGGTGCCGGCAGGTGCTGCTATTTCAAGGCTGGGCC 

TCAGTGCAAGCTTGTGGTTTCTTGCCCACCTGTGATGTCCTCCCACTAAT 

GAAGGGGCTCTCCATCCTCTGTCTGCCTCTAGCAAGTGGAGGCTCTGGGC 

CCTGGGCAAGACACAGGGGGAAATGCCATCTGTTATCCAAATATATTTCA 

ATGTGACAGGAAAGCTGTCTTTAGAGCACAGC 

>Concig52 

GCATGTGCTCTACATTGATCCCAGGAGTTTGAGACAACATTGCAAGACTG 
GGCAACAAGCAAGACTCTGTCTCTACAAAAAATAAAAAAAATTAGTTGGG 
CATGGTGGTACATGCCTGTGGTCCCAGCTACTCCTAAGTTGAAGAGGGAG 
AATTGCTTGAGGCCAGGAGTTCAAGGCTGCAGTGAGCTATGATCACACCA 
CTGCACTCTANCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAATAA 
TCGTAATACATTTTTTTTAAAGTAAAACAAAAAAAGGTCACACTTTCTCA 
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TACCAAAATAAATTCCAAATAAATTAAAGGCTTAAACATGAGAAAGTTAA 

ACCATAAAATTACTAGAAGAAAATAAAAGCAAATATTTAGATAATCCTGG 

3GATAAATTTCTTTGGAATGAATTTCCTTAAGATGAATCTCTAAAAGTGA 

AATTCAGGG7TCAAAGGTCTTTTCTTTGTCCTTTTCTTTTCCCTTTCCCT 

CTCCCTTTTTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCT 

TTCTTTC7TTATCTTTCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTTCC 

TG2TTGC7TGCTTTCTTTCTTTCCTTCCTTCCTTCTTTCTCTCCCTTTCT 

TTCTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTT 

CTTTCTTTCTTTCTACTTTCTTTCTTCTCTTTCCTTCCTTCCATCTTTCT 

TTCTTTCTTTCTTTCTTTTCTTTCTCTCTTTCTCTCTCTCTCTCTCTCTT 

ICTTTTTTTTTCTGGTGAGACAGGGTCTCATTCTGTCACTCAGACTGGAG 

AACAGTCGCATGAACATGGCTCACAGCAGCCTTGACCTCCTGGGTTCAAG 

CAATTCTCCTGCCTCAGTCTCTCAAGTAGCTGAGACCACAGGCACCCACC 

ACCAAACCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCAC 

ATTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCTGCCTGCCTT 

GGCCTTCTGAAGTGCTGGGATTACAGGCTGGGCCTCTACGCGCGGCCGAG 

ACTACCTCTCTTTTAACTGGATCTCTGAGCTCTGGGCAGAGCCCACCCTG 

AATCCTGGTCTCCAAAAAGGGAAAATTATTAGGAGGCTAGACCATATGAT 

GCTTTTACAGTGCACTTAAAAAAAAGTTTGTTTTTTTTTTAAAAGACATT 

TCTACATGTCTAAACTACAATCTTCCTTGAAAACCCAAGAGTAGCTTCTG 

TTGCAATAGCTAGTCAAAAATATAATAGTCAAAAAAATCAGGTAACACAA 

CACAAACGCAAGCAGTTTAAGAGCTGAAATGAACTTGTCTG7TTACACTC 

TAGGGATTCCATAAGGAAAAATAGAAGTTTCTCCCTAAAAGGGAGCCTGG 

CACCTTCTCCATTTTCTTTAAGGAACCCCAGGCTATTATAAACTATTTTA 

GGGCTCTCATGCAGCAGACGGTGCAAGAGAAAGGAGAGACAGCAGAAGTA 

AATGAAGAAAACAGAATCCAGTCAACAGAGAAGAAAAAAACTTTTGCTCA 

AAAAAAGGCAAGTTCCTAGGAAAGAAAAAAAAAACATGAGGGCTATTTAA 

ATACAAAGACGCATACATACACATGCACACATCTTGGATGTTAGCTTTTA 

ATTAAGCTGACTTTTAACTATTGAGGTCCTTTAAAATAAATCTTTTAAAA 

TCTTATTACGATATTTCAGCTAGGACAAATTGCTGCTATTTCAGCATTAC 

CAAGTATCAAACCAGAAAAGGCTTGATTTAGGAACCAAACCCAGGCTGTC 

GTGGTAGGAAAAAAGGCAGAACGTTAGCTATGGAACCCACAGCATGGGGC 

AACAGCCATTGCTCTTTCAGTATGGCCTGGCTAGCAAAAAGGTGGCCTTG 

TTATGTAAATAAAGCCCGTTTGGTGGTCAAAATGAAACATCTTTTCCTTT 

TTTTTTTTCTTTTGCTGGCCGTTTTTTCCCCCACCATACCACGTTTGTGT 

GTGTGGGAGGGTGGGAATTTAGCCACTTCAGAGGCCTCATTCCCCATAAT 

TTGGAAA7TTCCTTTGGATTTGATCAAGTCAGATAGAGTAGGTCAAACCC 

AATGGGAAAAAGACTGAAACAGCAATAAAAACAGAAACAAACAGTTAAGC 

AAAATGAATGATCACACAACTTATATGATTACTGAGTGCTCTAATGGTAA 

GGAGAAATTAAGACCAGCTGGTTGTTAAACTTTAGCCAAGACAAAACCCC 

AATTCAGCTACTTACCTAGGGTTGGGTCTCAGGCTGAAGACCGCTCACTA 

CCGTTCTAGAAGCAAGAAATAAAACTTGAACTCGTCTTACCTGTGTAGCA 

GGACAAGCCGCAGAOVAAATCCCTCAGACACCAAATTAAAGAAGGAAGGG 

CTTTATTGGGCCTGGAGCTGCGGCAAGACTCACGTCTCCAACAACCGAGC 

TCCCCGAGTGTGCAATTCCTGTCCCTTTTAAGGGCTCACAACTCTAAGGC 

GGTCCACATGAGAGAGTCGTGATAGATTGAGCAAGCAGGGGGTATGTGAC 

T GGGGG C T G CATGCAC CTGTAGTT AGAATGGAACAGAACATGACAGGGAT 

CTTCACAGTGCTTTTCTTATGCAAATAACCGATTAGATCAGGGGTCGATC 

TTTACCAGGCCCAGGGTGTGTCACCGGGCTGTCTGCTTGTGGATTTCATT 

TCTGCCTTrTAGTTATTACTTCTTTCTTTGGAGGCAGAAATTGGGCATAA 

GACAATATGAGGGGTGGTCTCCTCTCTTACCTGCGGGGAGTGAGCTCAAA 

CTCCTTAAAGGAGTTACCTGCCTTCCATCATCAGGGAAGCAGGAAATCTT 

GCCTTCCTTGTTGGAAGCAAGTAAAACTCAAAACAAACAAA GAAA AAAAC 

AGGGAGTTGTACAGCAAAATAAACTTTTGATTTTGACCAAATTTTGGGAG 

ATCAGGAATTCTCTGAAGGAGATGCTTTCAGACCTCAGCAAATTGTCCTG 

T7GGTTTGAGCCATAAAGTTAGCTCATGCTGGTACCAAACACCAGTAGGA 

GATT7GTCAAAGGTAAGAGGCATCTCCACTCAGAATCCCTTCGTGGTTAC 

CAACATGTGAACCTTGGAAATCTGAGACAGGTCTCAGTTAATT7AGAAAG 

TTTAT777GCCACGGTTGAGGACACCCACCCATGACAGAGCATCAGGAGG 

TCC7GACCACATGTGCTCAGGGTGGTCTGAGCACAGCTTGGT7TTACACA 
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TTTTAGGGAGACATGAGACATCAGTGAATATATGTAAGATGTACACTGGT 

TCCCTCCAGAAAGGCAGAACAACTTGAAGCAGGGAGGGAGCTTCCAGGTC 

ACAGGTAGGTGAGAGACAAACAATTGCATTCTTCTGAGTGTCTGATTAGC 

CTTTCCAAAGGAGGCAATCAGATATGCAT77ATCACAGTGAGCAGAGGGG 

T GAG TTTGAAT AGAATGGGAGGCAGGTTTGC C CTAAGCAGTTCC CAGCTT 

GACTTTTCCCTTTAGCTTAGTGATTTGGAGGCCCCAAGATTTATTTTCCT 

TCTACATCAC7GTGGGCAGCTGACTAGGAAAGC77TGTAGGACTGGTGGG 

CAGTGTGAGAGCCCAGTGGGGGGTGGTGGTCCTGTGCCAATGGTAGCAAC 

CACCTGTGAGGCTGAGTAAACTCATTTCCCAACCTCCTCTAGCAGCCCCA 

GTGGAGATACAGATGAAGCAGACTAGCGATACAACCCAGCCTGAAGTTTT 

GTCTGGTGAGTGTAATGGAATAAAAATGGGAAGGGTGCTGAAGAGACCAG 

CAAGAAAATGGTTGAAGAGATGGGGCACAGAAATTAAGCTGGATCAAAAA 

GGACGGAAAAGCAGAAAGGGCCGATAGAGAGAGGGGATATCTATGGGTTC 

GCGATTCTGAAAAGGACAAATCACTGGTGCTTTGAGAAGAGAGAGGGTGA 

GAAAGCAGGAAGGCTGGAGGCTGTCATCCAAGAGGCGGACATCTGTGAAC 

ATGATTCCAAGAGTCACCAGACCATGGGGGTGGCCAAAGGGAGTGCCTCT 

TCTCACCTCCTACTCTTAATTCCTTGTACTCAAGATAATAAGTTCCCAGA 

AGAGAAGTACCCATATTTAATTCATCTGTGTCTTCCTAGCAGTACTAAAA 

ATATTATATGAAAGGTATCAAACCTTTGAGAATGTGTGCTGCTAAATTGT 

TAAGCSATGCTGGAAAACTCAAGACGTCCCTGATCCTGAGCCTGAGTATGA 

GCCTGTGG7GAGCCCAATGCAGGTCTCCATTCAGACAAAGGCCTCAGGGA 

ACGGATGAGACCTAGGGACAGAGATGCATGCTGGAGCAGCATTCCCCATC 

CCTACTGCAGCTCAGGCCAGCTGACTGCTTTATGAGTAAACGTTACCAGG 

GAACACTTTGCAGTCTTAACACACATGCCCACCTGTGACCACTGATCCCT 

GTTGGGTGACCACTGACATCAGAGATTCGATGGCAGCAATGAAGACAAGG 

CTATCCTCATTAGGAAGGAAAGGAAGGAGGAGGGAGGAGGGCAAACGAAT 

CTTTCCTGCTTGTCAACCACGTCCATCTCTGTTAGGTGATTTCCCATGTG 

TGACTTTGTTTATCTTTATAATAACTCTGAGAGGTAGGTCTTGATGTCCA 

CATTTTGAACATGAGGACATCCAGCCAGGAAGTTGAGTTCTGGGGACATA 

GCTGAGAGGGCAAAGCTACATATAAACCCCTCTTTGTTTTTTCTGGCTTA 

TCCACTGAGTGCCCCCTGCAATCCACCAGCCCATTTGTGAAGTGCATACT 

ATAGGTAAGTTGGCACAGGAGGAGTGGATGTGGGCGATTTTGTCACAGCT 

CTCCAGGAACTTACACACTGGTGAGGAGGGCCAGGTATGTTCCTGACCAG 

TCACAATCAAAGCAACCTCCTACTAATCAGGGAGGCTTGGTACCTGGGGA 

ATGCTATGTTGAAAGGTTCTTTTCTGGGTTTTAAAATGATGGGTCTATTT 

CCTTATTCTTAAGATTGCTTTTTTTCTGGCTAGAACTTAAAAGAAATTTT 

CAGTAAAATTTCCCTTCCCTGGCACAAAGTGAGCTTGAAATGAATTCCCA 

GGTGGCCTTGATACTTTAAAATATTGCCTCCTATAAAATCAACCTTTAGA 

AGAAGGAAGTCAAAGAACATGCTAGATTTCACAAAGGTTAATTCCTTGAA 

ATCCAGTTATCTACAGGACAATGTTGTCAAAGAAAAAATTATTTGGCCAG 

GCACGGCGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCAGG 

TGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAA 

ACCCCATCTCTACTAAAAATACAAAAAAAATTAGCCAGGTGTGGTGGTGG 

GCACCTGTAATCCCAGCTACACGGGAGGCTGAGGCAGGAGAATCGCTTGA 

ACCCGGGAGGAGGAAGTTGCAGTGAGCCAAGTTCAAGCCACTGCACCCCA 

GCCTGGGCAACAGAGCAAGACTTTGTCTCCAAAAAAAAAAAAAATTCAAT 

GATATTTTTAAATTCATGGTAAGGAAGATTTCATTCAGAACCAGCACAGA 

AGATATAGGAAACACTGCAATGGGACTTTGCGGTGGGGGAGAGAGATTGA 

ACACAACTACATATACAGCACGGGCAAGGACATATTCATAGCCAGGAAGC 

AGAGCAAAGATCAGTGGATGCGAAATTACTAAGAGGAAACATGAAAAATA 

AGGGAGCTTCTGCCTAAACCCACCTAACCGGATCCTTGCTGAAGACAGGA 

CAGGGTGATTGGACACCACTTTGGGGATGGTGGAGGATGGGGAATCCAGT 

GAGATTTCAAGGGTGATCAGATATTGAACATAGAAGGTTCTTGCTAAAAA 

AGGAGTTTACAAGAAAGTGTACAAATGTGCCTGGGAGAAGGTTCAGGAGC 

CTGACTAAAATTTGGTCAAGCAGAGAATATTTGCCAAGATAATAGCTAAG 

TCTTCTGACAAACAATAGATGCTAAGCCAGCAAGGGTGATGTGCTCAGAG 

AAAGCACTGAGGGCTTATTTCCTTTTCCCCCAATCTCCACTCAGTCAAGT 

CTAGTCCCCTTGTCAATGTAGCCATTTGTAAGAATGCAATCAGGCAGGGT 

CCCATCTCCTAGTGACAGGACTGACTGAAGTTC7GCTGAAGAGAGTGGCC 

TGGGGCTGACACCGAGATTTCAGAGTCCTGGGTTTCGCCGAGAGCTCAGT 
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GTAGTGCCATGCCCTCTLTCCACCTGAACGCCCAGTGTGGGCAGGAACAA 

CTGCAGCTAGAAGTCTGGCACTTACGCTGGGG7CTAAGACCTGCCTGATC 

TGCTAACTAGTCTTGTCCCTTGGCTATAAACTGACGTTGGCACCTGGCCA 

GAAAGATGAGCAAGAGATCT CTGACACACCTTTAAGTCC CTG7GGAGTAG 

GATTATGTTGGGGAAGGTCATTCTCTTGACTGAGCAGCAATTTCAGAAGG 

AAGTCCCATGCCGAAGTGAGAGAAGGCAGGGAATCCTGCCTAGTCAGCTA 

GAJSCAAAACAGTCTGCAGGACGGGACCCAGGGATGTGATCCTCCCATCCA 

AAGGCACTGAACTAAATGACTAAAATACTTTCCAGGGCTCACGTTCTTTG 

AAGAATGGGGACTAAAACTAAGACAGGAGCCAGCAAGTGAGGACTTGGAA 

GGAGATGGCTCATCTGATCAGCCTCCACTCAACAATTTTAATCATCCACA 

CTGGCATGGGGACACAATATGAATAAGTTGACAGGGACCTACTCTGATTA 

AGCAGTGGGC7AGTGCAGAGACCTGTCAGTCAAGAGTGGACAGGAGATGA 

7TTCAGACAGTGAGAACAAAATTAACAGAGTCATGTGCTAAAGGGTGGCT 

GGAACTACAGAGGAGTTTAAGACTCAAGAGGTCTGGCTGGGCGCGGTGGC 

TCATGCCTGTAATCCCAGCACT7TGGGAGGCCGAGGCGGGCGGATCACAA 

GGTGAGGAGATCAAGACCATCCTGGCTAACGCAGTGAAACCG€ATCTCTA 

CTAAAAATACAAAATATTAGCCAGGCGTGGTGGCGGGCACCTGTAGTCCC 

AGCTACTCGGGAGGCTGAGGCAAGAGAATGGCGTGAAC CCGGGAGGCAGA 

GCTTGCAGTGAGCCAAGATTGCGCCACTGCCCTCCAGCCTGGGCGACAGA 

GCGAGACT CCGT CTCAAAAAAAAAAAAAAGACTTGAGGGAGTTGTTTATT 

TTTGTTTTCTTTTTAAGACAGGGTCTTTGTTGGGCGCGGTAGCTCACGCC 

TGTAGTCCCAGCACTTTGGAAGGCTGAGGTGGAAAGATCTCTTGAGCCCA 

GGAGTTTGAGGCCACTCTGGGCAACATAGCAAGACACCGTCTCTACAAAA 

AATGTGCAGGT7GAGGCTGCAGTGAGCAGAAAAACACCGCTGCACTCTAG 

CCTGGATGACAGAGCGAGACCCTGTCTCGGAAAAAAAAAGAAAAAAGACA 

GGGTCTCGCTGTGTCACACAGGCTGGAATGCAATGGTGCAATCATGGTTC 

ACTACAGCCTGGAACTCCTGAGCTCAAGCAATTCTCCTACCTTGGCCTAC 

CAAAGTTCTAGGACTACAGGTGTGAGCCACCACACGTGGCCTCAGGAGAG 

ATCTTAATAATAAAAGGACAAATTGCCTTGCATCCCTTAGGGGCAGGATT 

GACACATCCAAGGATCAGGCAGAAAGCCTGTGCGGAGTGGGATGAGCAAA 

GAGAAAGGCTGAGAGTTGTGAAGAGGGAGATGCAGTGCCAGCTAGGACAG 

GCCTTTTTGGGCTATGGGAGGTTTTCAGAGGAGACCCCACCTAAACTAAC 

CCATAACATTGCAGTGGGGACCTGTTGAAGTCATGGACTACTACCTGAAA 

GCCAGAGAAATGGGAGGAGCCTTTCCTCTGAGGAGGGACTCTAGTCCATA 

GGTATCTTGCCACCAAATACATGGACAGGCCCTGGGGGAAGATGGTGGTA 

GCCCAGCTGGAGGAAAACCATTTGCCACCTGAACTAGCCCAGGGTAAGCC 

ACCCAGGCACTGAGGGTGCACACCCATGCATGCACACACAGAATCACACT 

CCTTCC7ATTATTCCTCAATTCAGGGGTCTCAACACCCATTTTTTTTGTT 

TTTGGGGTTTTTTTTACATGTTTACATTTTATTTATTTATTATTTTGTGA 

CAGGGTCCCACTCTGTTGCCCAGGCTGGAGCACAGTGCAGTCGTGCAATC 

ATATTAGATTGGTGCAAAAGTAATCACGGTTTTTGTCATTAAAAGTTTTG 

CCATTACTTTTAATGATAAAAACCACGATTACTTTTGACGCAACTTAAAA 

GCTCACTGCAGCCTCAAAATTCCTGGTCTCAGGGAATCCTCCTGCCTCAG 

CTTCCTGAATAGCTGGGACTACAGGCACATGCAATCCTACCTGGCTAATT 

TTTTAAAAA 1 T 1 lUUll^TAAAGATAGAAAGTCATTTTGTTGTCCAGGCT 

GGTTTCAAACTCTTGTCTTT G TGCCTCCCTCTGCCCTGTGCAAGACCTTC 

TGGATGCCCACTAATGAAGACTTCCAGGGAGAGGAAAAGTAAACATAGGT 

CCCTGATCAAGGGACCAGGGTTTATCGACCACAAACAGCATGCCCAGATT 

CCACTGGCAGTCCTAGAGGTCGCATTTGCCCCAAGTGTGTGTGGAAGGCC 

TCTCCCTAGCAGTTGGTTTATACACCAGCCACAGCACAGCATATTCTCTT 

AAATTGTGA ACATT TGCAAAAACTCCTTGAGGACAACTATCATGTCTTGT 

GTACTTTTGTTTTGTTTCCCTTCCCCTATGTACACGCGCGCGCATGCACT 

CATGCACGCACGCGCGCGCGCACACACACACACACACCCCTCAAACTGAA 

TGCCTGGTGTGCTGAATGGATGAATGGCTAATGTAAGTCATTCTAAAAGC 

TACTrTGTTTGGCATACCATCACCTTTGATTTCATCTTTCTGGAACTCCT 

ATGTTCCCAGATGAATTTGGAAAGCCCTCAGGAAACATTTCAAAATTGCT 

ATATGGGAGAAATGGGAGGGTCTCTCTAGAAATTTACCTGCCACAGGTAT 

TTCTGGTAAGACACAGCAAAGGTGGCACCACCCATTCCTCGTTACAATGT 

CAATGCCAGTCACCTTCCTGTCCCATAAAACTTTATTAAAGGTGCAGAAT 

rCCCATGGAAGCAGGTGGACACCATCTGCTTCCAGCCAGCCAGGGGAGCA 
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AGGTGTCCACTG7GCC1TTGTGGCAGGAACTGCGCTTCTCTACTCTCCCA 

C777GAGGCC7C7GGGGC7GGCC7GC7GCC7CC7CA77GACAAGGC7GC7 

7AC7GAGCAG77CA77C7GAGC7GGACA7AG7GC77C7GG7GAG7C7C7A 

C77C7A777AACC CAAAGATATTC7TTC CTAAGGAAAC GCTTTCCTGTCG 

GGGGAGGTTAGCTCCAGATGGAAGTCACAAGTGATGGCATGGTAGCTCTC 

ATCCGTTTGGGTGGATGATATTCACGGAGCACCACCATGAGCCAGTCATG 

GAGGTGAACAGTATATGCCAGCCCTGAATCAGGTGCATTGACAGCAAGGG 

AGACAAGCAAACAAAGCTGAGGTTTGCTGAGGATGTTCAAGACTCACACA 

GCACAGAGGAGCATCCACCACCCAGCTTGGGAAAGGACTTGTTATAGAGG 

GGGTGAAGCATGAGCTGAGTCTTGAAAGACTAGAAATTAGCCAAACTACA 

AGGAGGAGAAGGAG777CCAG7CAGGAAGAACAGG77A7GCAAAAGCACA 

GAGACTAGAAAGAATATCACATTCAAGGAACTGCAAATAGACAGGAAAGA 

TTGATGCGTGGGATAGGAGAGGAGGGCAGGGGATTCCAGGTGGGCCCTGC 

TTGCCACACTCAGGAGCTTGAACTTATCCACAAAGGAGGTGTGGAACCAG 

TAATGAATGGGTTTTGTGCAAGGGCTTCATGTCACCAGATTTGCTTTTTG 

GAGATACTTCTGTGGCTGATATGTGAGGAAGGGATGGAGGAAGTTTCCGT 

GGCAATCAGGAAAACCAATTAGCAGATGATTCAAATGGCCTAGGGGAAAA 

GGGAGGAGGAC77GGAC7ACCA7GCAGCAGCAGAAA7GGAGAGAAA7AAC 

AGATCCCAGGCACTCAGGAAGCGCTCAGAATGAGCCCTTCAAAGAACTTA 

TGG7AGG7GA7GGA7GGA7GGAG7G7GAG7CC7GGGA7AGCA77GCC7GG 

GAAAATACTTTCTAGTTGAGACAGGGAAGTGGGCCAGCAGAAATGGAGGG 

C77C77C77777GC777AAA7AC7777A7AA7A777GGAAC777GAAAA7 

GAGCAGA7A7A77AGCAAAAAGCC7AAAAGGGA7A77777GAAA7CAC7G 

C7AG77C7AACA7A7AAC777CAGC77GCACACA7CA7CAA77AAC777G 

ATAGCGCCTTTCTGAAACTATCATCCCAAATAGCAATCCTTGTAAAAACC 

TATTTTGAAAAACGGGCCTTGTAGGATAGCCTCACAGATGTTTTGTGGTA 

GATTTTCTAACATTCTAATGTCAGGGAGTGAAAGGAATCCCGTTAGAAG7 

7GGAAAA77C7GGAA7C7C7A77CA7GG7A77AAAGT777GCCG7CACAC 

AAAAG777AACACC777ACACAA7CAGAC77CC7CAT777ACA77GC7CG 

G7AAT7AGAGGAAA7CAG7CACCCAGAGCCTGGG7CCTAGACT7GACAAA 

A7GCACCCAACAAA7CC7GAG7GGCC77GC7GAGGAC77C7CCCAGAAGA 

7AGAAAAC7CAG77CCAGCCAACAAGGGGGAAGCAGC7GAAGAAG7GAAA 

77AACAAAG7CC7GGAAGGAAA7GACCAAA7CA7CTT7GAT7G7G7AA7A 

AC CAGAGAG7AGAA7ACAGC7ACGACAGACA7777GGGAGAGAAGCA777 

7A7CA7AGC7777AGAAGAGAA7A77777CAGCA7CA7AAGCACACAA77 

CCAAGACAGA7AC777CAAGGGA77G7Tr7GACG 

>Conuig53 

A7G77NNGG7777GGGACCCCA77CAAAC77CA7G77GAA7777AA7C77 

ZAA7G77GAGCGAGG7CC7G7GGGAGGG7GA77GGA7CA7GGGGG7GGGT 

7C7CCC77GC7G77C7CAA7GA7AG7GAG7GAG77C7CACAAGACC7GG7 

TA777GAAAG7G7G7AGCACC7C7CCCC77CA77C7C7CAC7CG7CAC7G 

C7CCGCCA7AG7AAGA7G7G7G7G777CCCCTTTGCC77CCGCCA7GA77 

G7AAG777CC7GAAGCC7CCCAGC7A7GC77CC7G7ACAGCC7G7AGAAC 

7G7GAA7CAG77AGACC7CT777C77CA7AAA77ACCCAG7C7CAGG7CA 

TTCT77A7AGCAG7G7GAGAGTGGA7GAATA7AG7GCCA7A7G777G7A7 

TCCCAGC7ACCCAGGAGGCTGAGG7AAGAGGA77GC77GAGCC7GGGAG7 

T7AAGGC7GCAG7GAGCCA7GAC7G7ACCAC7GC7C7CCAGCC7GGG7GA 

CAGCGAGACC77G7T7CCAAAAAAAAAAAACCCAAAC7G7G7AAAA7G7G 

77CA7AAAAG7G7CT7GC7CCCACACCTG7CCC7A7A7A7C77A77CC7C 

AGCC7CCGACAAC7AC777A7TCA777CTTATG7A7C7TCCAGAA7CAAA 

AAAAAAAAA7CAAA7ACAAGCACAG7GGAA7G7A77GCCC77C77CCCC7 

CCC 7777G7 7ACA7CAGAGT7AGCA7A7CA7AAA7ACGG7C7GCA7777C 

T7CTTTT7CAGC7A7CAGCA7G7777GGAGAGGA777CA7A77CG7GCAG 

ACAGCA7G7A77AG7CAG7CC77GCA77GC7A7AAGGAAA7ACC7GAGAC 

7GCA7AA777A7AAAGAAAAGAGG777AA77GGC7CACAGC77CGCAGGC 

TG77CCACAGGAAGCA7GGCAGCA7C7GC77C7GGGGAGGCC77AGGAAG 

C7777AC7CA7GCAGAAGACAAAGCGGGAG7GGA7G7C77A7A7GGCAGG 

AGCAGGAC7GAGAGAGAGAGAGAGAGAGAGAAAGGA7GCCACA7AC7777 

AAACAACCAGA7C77G7GGGAAC7C7G7CACGAGAACAGCACCAAAGGGA 

TAG7GC7AAACCA77CA7AAGAAC7CCACCCCCA7GA7CCAA7CACCCCA 
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CACCAGGCCCCACCTCCAACATCGGGGATTACAATTTGACATGAGATTTG 

GGCTGGGACACAGAACCAAACAATACCAGAGTGCTTTCTCATTCTTTTCT 

ATAGCTGCCTAGTATTCTATGTCCTTTACTTCATTTAGGCAGTCTCTTGT 

TGATAGACACTTGGGTTACTTCCAATTT7TCCTATTACAAATGATGTGCA 

ATGAATAATTTTGATCATTTTCCATTTCACATGGGTTATGTCCATCTGTG 

■GGATAAATCTCCAGGAGTGAAATTGCTGGATCAAAGGGGAAGTGCACTTG 

TGATTTT CAT AGTTAGCAAATTTTGTT CTATAAGGGTCATATCAATTTAT 

AGTCCCACGCGTAATATTTAACAGTGGGGATTTCCCGACAGTTTGACCAA 

CAAGGTCTGTTGTTAAACTTTTGATTTTTGTCAATCTGATGGGAAAATAC 

TAGTATCTCAAAGTGCTTTTAATTTGACTTTCTTATTACAATGTTAAGCA 

TCATTTTACTCTGCCCAAGATCAAATAGTATTTTCTTTTCTGTGAACAGA 

CTGTTAAGATCCCTTGCCTCTTGTTTTGCTGGATTTTTGTTCTTTTTTTT 

CAAATGTTTTGAGGCAGTTCTTTACATGTGAAACAAGTTATCTCTTTATC 

TGGGGTGTGAGTTACAACTACTTTTCCTCTGGCTTGTTTTGCGCTTTGAC 

TTTGCTTCTGGTGATTCCCGCAATTCTGAAAGTGTACTTTTTGCATCATT 

CATTCTTATACACCCATGCTCTTGTTCACGCTGGTTCCTCTACCTGAGGG 

CTTTTTCTTTTCTTTTCTATCTGGGAACATrrTTTAGAGACAGGGTCTCA 

CTCTGTCATCCACGCTGGAGTGCAATGGTGCGATCACAGCTCACTGCAGT 

CTTGAACTTCTGGGCTCAAGCAATCCTCCAGTGTCAGCTTCCCAAGTAGC 

TAGGACTACAGGTGCATGCCAGCATGCCTGGCTGATTGTTTTATTTATTT 

ATTTATTTTTTGTAGAGATGGGAGTCTCACTATGTTGCCCAGGCTGGTCT 

TGAACTCCTGGGCTCAAGCGATCTTTCTGCCCCTGCCACCCAAAGTGCTG 

GGATTACAGGCGTAAGCCACCATGCCCAGCCCATGTGTGGAAATCTTCTG 

TTTATCCCTTTAGGCTTGATTCTTATGTCGTTCTCCTCCCTCCTTCCTGG 

CTACTCCTCTTGTTCTTTATCTTACTCTACTTGTCATGTTACCTTGTTTC 

TGCTTATAACTAGCTGCCTCTCCTATCTGAGGAGGGACTTGTGACTGTTC 

TCATCTCTGTACTCCCAGGTCCTAGTACATAGCGCTTGCTCAACAGATGT 

TTGGTGCATTGATAGATAAATCAATGGTAGCTGTTAATACCAGTCCTGAC 

TCCCTGCAGTGCTTCAGCTGATCCTGTTCCAGATGTGCACTGAATATCTT 

TCTGTTGAACAACAGAAATAAAGGGGATGGGTGAGGAGGATAGTCTTCGG 

T GGCCA AGGATATTTGTAGGTACTTTGCAGCACTCAGCAATGAGGAGTGG 

GCTTTAGTCCCCCAAGAACTCTCACAGCCCTGTTTGTCTTTACTGTTCAG 

TGTCAAATCCAAGACAAGTCAATGATCAGGAAAGACCTTTTTTTTTCTTC 

AGTGAAGTTTATTTCAGAACCATTGAACAGTATGATATTTGCTCATTTAT 

AAATATTCCCATTTAAATAATCTGAGCTTATATATTTTCAGTCTTAATTA 

AAGGACTTGATTTAAAGAGAGCACACCAGTCCAAATTGAATTGATTCCAT 

AGCTATTAAAAACTAGGCTCTTTTACAGACACTGCTACTTCTTGCCCCCT 

TTGAATAAATTAGACCAATGAATAAAACAAACAAACAAATAAATAAATAA 

ATAGGGAAGCGGTTGCTCATCAGAATGTGGGAGCGAATGACAGAGGGTTT 

CTTAGAACCAAATGTGGCCGTGGTTTCTGTCAGGCGGGCTTTAAGTGAGT 

AGGAGAGGTGAGAGAGGCCTGGCTCAACAAAAGGGCTGGGGATTGGCCCT 

GAAAGGAGAGAGCTGACTGTCCTGGCTGATGGACAGGAGATCCTCTTAGC 

ACTACCCTAAGGCAGGCAGTTGGGCATTGGTGTAGACAACAGGAAAGTCC 

AGGCTATAGCCGTACTCAAAAACCTTTCTGTTCCCTTTCTGCCAGCCCTA 

GGGATTGAGTCCACATTCAGCACAGGACTCTCrGGGTACAGCTCTCTTTA 

GGAAGACACAAATTGCATGGTGAAGTCAGTTATATCCTGGCCGCCTTTGG 

TCCCTCCCAGGAAGACGGGCATGTTTTCTGCTTGAGAGGTGCTGATGTAC 

CAGTTGGGGAACTGGGCAGACTCAAATTCCAGCTTGTTATTGATTTCTAT 

CTTGTTGAAGACAAATCGCTTTTCCATCTTCTTCTTTGGGTAATTTTTGG 

GATCTACACTCTGCAGCGAAAGAGAAAGAAGAATTTTTGTGGGGCAAGGG 

ACAAAAATGCTATGGGAAAGATGTTCTTTGGGTTGGCCAGAAAGGAAACT 

GACGAGCAGGTCACATGATCAGGAGCCACACTCCTGAGTTGTAACTGGGC 

CCCCAACTTTCTGTGTGATTATTAAAAGAGCCCTTCTTCTTTTCTAAAAC 

TTAGTGCCAAATGCTGAGGAGCATAATGTAGGTGAGAATTTTTTTTTTTT 

GGGGGGGTGAAAAT7AAGCTAGAGCTTCTTGAAGTACCTAGTTTCCAGGG 

GCTTTTTATTGTATTTTTCCTTATGGTCCTAGAATGACATCAACTTGGAA 

ATGAAGCTTTTGCTGAGAAAGCTGGAGGTGATAGTGGTGGTGATTTTGGG 

AGTGGAGTGGACGTGATAATGGGACCCTTTAAGTCATCTATT7CCCAAGG 

TGTCTATCAAATGAGAGCAGCCCTAACAATATATAATCTGTTGGGGTTGT 

AACTATGGTAGGACATAATAACATCGGCAAAATGATTTAATTTTCTGCAG 
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ZAGGA77GAAGG77GCA^GCAG77AAAAn > rTATGTTAAATTTATTTXCAT 

7AATGCAAAA77GTCAAATAGACCTGTTCCCAGCT77TCC7AGGGATGGG 

GGCGGGGAGAAGG7GG7TG7C7GGGAA7AAG7GG7AGCAGGAGGC7GAGA 

AGGGCTTCATTCCATAGCATTCACTTACCTCCAGCTGTAGAGTGGGCTTA 

T CAT C777CAACACGCAGGA CAGG7ACAGA77C7777 CC77GAGGC CCAA 

GGCCACAGG7ATTTTGTCATTACTTTCTTCTCCTTGTACAAAGGACATGG 

AG&ACAC CAC7GAAGAAAGAAGGGGGTCTTGTGGTTAGGGACACAGCAGT 

G CAGGGT CACC CCAAC CC CT AGGCC CCATGAGTAGGATACATGTAATTTG 

GTAGCCTCTGTGGGAACCCACAGTGAGGTTCCTTGGCCTAAGACACAGGA 

TAAC77GAC77C7CACAGACAA7AGCAGGG7CA7777G77GA7T7AGGG7 

T7CCCC7CAAAGGCC7GAGGG777C7CAGAGCC7CA7AGCAG7AGGAACG 

GAGAA7GAAAGAGGG7C7ACA7777AAA7GC7GAAGGAAGGAAGGAAGGA 

AGCCA77G7G7CACTGGCTGGCAATGTGCCCA7CCACAGGAGCGGAACAA 

C77GA7CAATG7GGAAGGAAAGGAAAGAGG7GAGGC7G7AC77C7GCCAG 

AAATCAGGCACCAGAACTG7TTCAGGAACAGAGAG7AGCCCA7GGGAAGA 

AAC7GGGAGAGGAGAGGC7GAGC7GGGAAAGTGGC7CCAAAGAGAGACAC 

TCA7777GA7C77CCTCAG7CACAGCAG7G7CAA77GGAAGGCCC7GGGA 

TCAC7C77ACTACCCGA77CCAAAGAAACAGGA7777C77GGCC7GGC7G 

AGAGCAAA7AGC77CCCCC77GAGTGAGGC7G7CC77CAAAG7CAGCAGC 

C77AG7TGCCCACAC7CC7GTGCAGAGGC777GGC7AC7G7GGCACGA7G 

CCAGGCAGA7CACCACAGC7AA7GA7GGG77CACCGCAC77GAAAC7777 

GCCCG77ACAGCGGAGAGATA7AAG77CC7GC7GGGCGG7AAAAT77CCC 

TACAAGGAACCACC7GGCA77GGG7GGGACGGA7G77GGGGCAAGGGGGG 

AAGAC7GGGGAGGGGGATGGACACATTATCGCTCCAGCACTCTTGTTTCA 

G C CT CAA C AAC AGG AAGAGAGAAC C CACAGGCAG77 AGGC CATGTC CATC 

AAATGACCCCA7ATTGTGGAAGAA7TGACA7TGCAC7A7GCCCAAGAGAC 

TTGGG7GGACA7GGTCC7GGGAG7GC7TGAGCCGTCTAA777C7CAGGG7 

CACAC7CC7G77AACAAATGCACTGGCCAGTGC AA7CAAA7 G7GCCATTT 

CTAGGAC CAAAGT7TG7A7A77C C77T7TAATATTT77T77CACTTG7G7 

7GATCAT77GCC7TAAATTAAC7TTCTACTT7G7TTAAAACATGGAGAAT 

TAGCAAGC7GCCAGGAAGCCAGGCAGGGAAACCAGGATGTT7CCAT7TAC 

C7TG77GC7CCA7A7CCTGTCCC7GGAGG7GGAGAGC7T7CAGTTCATAT 

GGACCAGACA7CACCAAGCT7TTTTGC7G7GAG7CCCGGAGCGTGCAGTT 

CAG7GA7CG7ACAGG7GCATCGTGCACATAAGCC7CGTTA7CCCATGTG7 

CGAAGAAGATAGG7TCTGAAATGTGGAGCACA7GTTGTT7AGGTATAAAA 

7CAGAAGGGCAGGCC7CG7GAGGCAAGG7GGCAAAA777GA777C77GGA 

GGACACC7GAGCA7ATACGGTCAAAG7C7GA7GACAACACCAG7AGGGA7 

GAAGC7GGGAG7GGGG7GGCTAAGAACACTGGAC C7GACAC7A7TAGACA 

TGGG77CCAGC77CAGG7CTATTAC7GC7CACTG7GGCCGAGCAACAGAG 

C7AC77AGG7AAAATGGTGATGG7CATAACACTAGCCCACAGGGAGG7TA 

CGAACC7C7GGTGACAATGTAAGTGAAAGGCCCC7GAGAAAGAGTGAGGG 

AG77GCAAATG7CAG TAGC CA7CAAGATCT7CTTTA AGAA TAGTT7CCAC 

7AAAGAGA7GAT7GCTrTGGTTTCCAGCCTTC777GT7TTG7C7CCCCGC 

TGGGCC77C7ACC7T7AAAGGiGC7TTGGCTC7GGGGGAATTGAGT7GGCT 

GGGGC77GA7GACTTCCAAGAGGACACAAGTGGAGATCTAC7GCCTGCTC 

TTGGC7AAC7ACCTTCTTCAAAGA7GAAGGGAAAGAAGG7GC7CAGGTCA 

77C7CC7GGAAGG7CTGTGGGCAGGGAACCAGCA7CT7CCTCAGCT7GTC 

CA7GGCCACAACAACTGACGCGGCCTGCCTGAAGCCC7TGC7GTAGTGGT 

GG7CGGAGA77CG7AGCTGGATGCCGCCATCCAGAGGGCAGAGGTCCAGG 

7CC7GGAAGGAGCACTGCGGAGAGAGCGAGGGAGGGAGCCTGG7GAGGTG 

G7CC7GCCAGGAACCA7GCTTTGACATCAGAGAGTAGAAAGCTCAGAGAG 

GAGGAAAGGGC7TGAAAGAATCCCGAGC77C7AAAGA7CATCCC7CTCTG 

GGCCAGGCG7GG7GGCTCATGCCTGTAA7CCCAGCAC7TTGGGAAGCCGA 

GGTGGA7GAA7CA7TTAGGTCAGGACT7CAAAACCAGCC7GGCCAACATG 

GCGAAACCCCT7C7CTAC7AAAAATACAAAAAT7AGC7GGG7G7GG7GGG 

G7GCACC7G7AA7CC7AGC7A7TCAGGAGAC7GAGGAAGGAGAA7CGCTT 

GAAC7CAGGAGG7GGAGGATGCAGTAAGCCAAGA77GTACCACTGCACTC 

CAGCC7GGGCAACAGAGTGAGAC7C7G7C7CATAAAACAAAACAAAACAA 

AACAAAACAAAA7AAAA7AAAATAAAA7AAAAAGAT7A7CCC7C7C7GAA 

GC7CAAGGAGG77AAGGG7G7AC7CAAGGGCACACAGCAGG77AGAGGCA 
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GACTCAAGACTAGAATG i GGGCTTTCTCa^CACCTTACAGGCTATTCTTTT 

AGAATAAATCCCATTTC7ACTTTGTTCATCTTTTTTGTACATGCCCCACC 

TACACCATACATGTATACCTTCTCTATATCTTTTTGTATCCCTAATGCTG 

TCACACTATGATTTGCTTTTTCATGCAGATGACCATAACATTTTCCATTC 

AC C7A7GC7CAC7CAGCAAGTA7TCAA777TTC7ACAC7GTTC77777T7 

TCC77777CA7AACAC7G7C7CA7AGGCA77C7GCAAA7CC7G7GAGAG7 

ACJTTTTGTGAAATGTTACCACTTTCCTCTTATTCAGAGAAGCTCCGTAT 

TAAGGCTTCACTGAGGTTGCCTTAAGGCATGATAATGGTTCAAAGGCTTG 

AAAGACAGTTAAAGAGACCTGTAAGTGCACAAAAGAAAGTTGAGCAGGAG 

AGAATTTCTTGCCTGGAGCAGAGCCAAGCTACTGGAAGAGGCAATGGGGG 

CAAAGGCCAGGCAGACAAGCCAATGGGCTCCTCCCACAGCTGCAGCCAAC 

AAGTTATGCCAGTCTTAAAACTTCTAAAGAAATATGTTTTTAACAAGATT 

GAGGACTGGATTATGAGGCTAGGGGAGGCTATCACAAACTGGAATAAAAT 

AAAGCCAGAGAAAAGTGGCTGCCTTCCAACCTGCACAACTGACCTAGCTA 

GGCTGATGGCTGGGCCACCTAGGAAGGCTACTGAGCATCATATAAAACAG 

AAGGGACAGCAGGAATATAACATGGCTCTTTGTAAGGATGAGTCTGAAAA 

ATGACCATTTGCTGCCCAAATGCCCTTAGCTACAACTGAAAATATTTCAG 

AACTGGAGGTTGCAGGATGCTGGAATCTCAGAGATCATCCAGCTCAGCCC 

TTTATTTTTCAGATGA GGTC CAAAGCGGGTAAAATGACTTGTCAAGGTCA 

AACAGCAAGTGAATGGTTTTCTTTCAAGTCTCAATTCATCTTTTTGTTTA 

TATCATCTATGTCTTGTTGTTATAAGCTTCACCCCAGGTAGCAAAAAACT 

ATTCTAC7CAAAAGGGGTAGACATATGTTAGTTCTCAAGATCATCTCTTG 

GTtTCAGAGTTTAACTCAAGTGATTGGCATAGGCTGAATCCATCTCTTAA 

AAGGATAATCAAATTTATGTTGAAGACTTGGTTGTCTTCCTACTATGAAA 

TGGGAAACATTATCACTACTCCTCCCCTGTCACCACGAAGTGTGGCCACC 

ACCACCAACGTTAGTGAGTGACTGTGGTGATATGATGACCAAGTGGCCAG 

GTCAGCAAGTGGTGCAGCCTGTGTCTCACTGGAAGAGGTTAAAGTCTTTC 

TAAAACAAAATACCATGGCATCAAAGTGGCCCAGAACTCCCTTCTTTGAG 

CTTTCCCTGTGTTAGAGCCCTTCCTTGGGTTGGGAGTTAAACCCATAGTC 

TTACCTTCATCTGTTTAGGGCCATCAGCTTCAAAGAACAAGTCATCCTCA 

TTGCCACTGTAATAAAAACAGGGACATGTCTCAATTATGTCTTCTAAACA 

GGTTTATTTTTCCTTCCCTGTGTACAAGACTTGACTGTTCATAAGAAACT 

GCAAACAGCCTGCCTCTCAAAGCTGCCTGAAACACCTGGCAAGTTTCACA 

GTGATATGCGCAGAACAGTCCAGAAGGCAGATTCTAGGCCTGGCAGGTGG 

GCACCCTGGGTGCTCCCTGTTGGATCTTGAGGCCTAACCTCTAGCCCAGC 

AGAGTCAGCTAAAATCTGAGCTCTCCCTCTCCCTCCAAGCCACACTTTGC 

AAAGGGATTCCTTG7ATTGTGGGCTTGGAATCTTTTCTCCCCATTTGCCT 

CTGCAGGAAGCCCTTGCAACAACACATCTGGATAGCCTCCAGG7CCCAAG 

3CTGGAGGGACTTGTAATGGGAAAGTAGTCTTTAAATCAGAT7TACTTGG 

CACC CTGTTTGCCACTGAAAGAGGCAATTTAGGGGAAAAATCTGGTCTCC 

AAGCACAGATAACACTCTACTCTTGAAAGAGGAGACCTGCTCATGTTACT 

GGTCTCAGCG7CTCCACTGACCTGTAATAAGCCATCATTTCACTGGCGAG 

C7CAGG7AC77C7GCCATGGCTGCTTCAGACACCTG7GTAAAAAGGAGAA 

AATGAG7GAC77CCCCATGACGGCTACGTTCATG7G7GATTTCTCTCAGC 

ATCCAG7GCA7GGCAGTCATGCAAAGAAATGATC7C7GAG7AAA7GAATG 

AA7G7G7GAAAGAGAAG7CCTTTGGGTCTAGAGAAAAGCA7T7GC7AAAC 

CAAACCCCAAC7AGCAA7GTATTGGCTAGGAGAGC7GGAGCAGAGGCT77 

GACAC7AACC77TAGGG7GTCAGCTGTTAGATAAGCAGTA7CCA7TCCCA 

GAA7A777CCCGAGTCATAAGCATTATATTACACC7GGCATT7TTGCAAA 

AAGC7GAGAGAGGGAGGCAGAGAGGGAAGGAGAGGGAGAGAGAGAGAAAG 

AAAGAGAGAGAGAGAGAGAA7ATGCA7ACACACAAAGAGGCAGAGAGACA 

GAGAGAC7CCC7TAGCACCTAGTTGTAAGGAAGA7TAAAG7CA7ACT7GA 

GCAA7GAAGA77GGCTGAAGAGAATCCCAGAGCAGCC7G77GTGCC7TGT 

GCC7CGAAGAGGTT7GG7ATCTGCCAGTTTC7CCC7CGC7G77TT7A7AG 

C7T7CAAAAGCAGAAG7AGGAGGCTGAGAAA7T7C7C7G77GAATACC7G 

A777CACAA7CAAG77AAAGGAAAGGGGAAAAGAG7A7TGG7GGAAGCTT 

C77AGGGGAGGGGAC7AATAAAC7GAGA7AA77C7C7GG77CA7GGAAGG 

GCAAGGAG7AGCAAAC7A7GACACA7T77GCAAA7G7ATCACCA7GCAAA 

7A7GCA77G7777CC7GACAA7CG77G7GCAG77GA7G7CCACA77AAAA 

7AC7GGA7777CCCACG77AGAAGAA7GTT7AAA77TAG7A7A7G7GGGA 
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CAAAG7GGAAGACACACAGA777A7ACXaGCACA7AC7777C77CA77CA 

C77C777G7AC77AAG777AGGAA7C77CCCAC77ACAGA7GGA7AAA7G 

GG7ACAA7GAAGGGCCAA7AGCCC7CCC7G7C7G7A77GAGGG7G7GGG7 

C7C7ACC77GGG7GC7G77C7C7GCC7CGGGAGC7C7C7G7CAA77GCAG 

GAGCC7C7GAGGAGAAAA77GACC777C77GGC7GGGGCAGAGAACA7AC 

GG7A7GCAGGG77CAGGC7CC7GACGGAG77GGGGCAACCC7GGAGA7AA 

GCTCACACAACCCTGCAAGACCAGGTGCTGTTACCCTAGCCAATCTCA7G 

GATGAACCAGATCAATGCCAGATGAGCTCTGCCTAAAATGATTTTTTGGT 

GAACTCTGAAAAGTGGAATATTGTTTCTGTAAGAATATCCATCTGAGACT 

CTATCTC7TGGTAATACCAAGAGTTATCAGTTTCTCTTTAACCGAGACAC 

CAGCAAAG7GCCTGCTCCAGGGTACTGCCCAGGGGAGCCCTCCATTTGTA 

GAA7GAA7GAGAG7CCAGG77A7GAACAG7GCC7GGAG7G7AGGAACACC 

C7CC777GCC7C777GACAGG7C7GCA7CA7AACAC77777777777777 

TGAGACAGAG7C7CAC7C7G7CGCCCAGGC7GGAG7GCAG7GGCACGA7C 

TCGGCCCCC7GCAAG77CCGCC7CCCGGG77CACACCA77C7CC7GCC7C 

AGCC7CCCCAGCAGCTGGGAC7ACAGGCACC7GCCGCCACGGCCGGC7AA 

777777G7A77777AG7AGAGACAGGG777CACCA7G77AGCCAGGA7GG 

7C7CGA7C7CC7GACC77G7GA7C7GCCCGCC7CGGCC7CCCAAAG7G77 

GGGA77ACAGGCG7GAGCCACCG7G7CCAGCC7G7AACAC77C77A7AGC 

AC7GAG77GAAACC7TGC7CC7CC7GG77CC7CCAGGAAAC7GAAA7C77 

T77GAGCCAAG7C7AGCACAG7GCC7GGCA7G7ACA77CAGG7GG7AGAG 

r77GC7GC77GAA7GGG7GAA7GGGAA777GACAGCA77777A77CAAA7 

7AG7A7G7GCCAGG7A7CG7GC7CGC7C7GCA77A7CCAAGGGAG7GAGC 

C7C7G7GCAAG7A777GAGACACGAGGGAAA7AGG77C7AC7G7GGGAAA 

AAGAGCA777CA7GGAC77GC7C7CCAAGCAGCC77C7GA77777AA777 

GGC7CCCAG7A7C7TGA7A7CAGGAG7CAG7CACAAGAAC7CCA7C77TA 

G7AAG77A7A7777CCACAGGAAA7C7AAAAGC7G77CAACA7G77AG77 

7CC7G7GAA7T7GA7AAGCCA7AA7CCA77CC7AACAC7GAGCCC7CC7G 

AAA777GG7G7C7GGTCC7GCAGA7AGC7AAAAGCCC7G7CTGGG7GGCC 

7AGGGGAC7CC7CTGT7T7GCC7CCACAGGATCCAC7TTGCAAA7TAACC 

ACTGGTrC7CCCGTTGTAGGAACTG C^CCTT CCTCAGAGCC7G7C7T7C 

77CC77CC77CC77CC77CC7C777C77777C777C7C7C7C7C777C77 
7CTTITCTTT7CTTrCTTrCTT7CTT7CTITCT77CT 

7C777C777C77CC777C777C7C777C7C7C777C7C7C777C7C777C 

7TTCTTTCTCTCTCCCTCCCTCCCTCTCTCTCTTTCTTTCTTTTTCTTTC 

77TTC7C7777C777C7C7C777C777C7CCC7CCC7C7C7C7C777TTC 

7TTG7C7C7CCC7CCCTTCTCTCTC7CTTTCTCTTTCTCTC7CTCTCTC7 

CC7AGACAGGA7C7ACC777A7CCCCCAGGC7GGAG7GCAG7GG7ACAA7 

CA7GCA77CA77GCA7GA7CACAGCAGCCTCAAACCC77CC7CAGAG7C7 

77A7GCGGCAACCAGCAGGG7C7GGAGGGTTGG7GGC7C7G7GAAC7C7C 

C7GACAGAACACAGAGA7G7C77TGGTCTGTTGATG7GAT7ACAAGC7GA 

ACGAAGGAGGA7CAAAGCCAGTGACAGGAAGGGAGA7ATGCAAGGGACCC 

GAGCA7CAGC7C7GAG77AG7CCA77C7GC77C7GGGAC77GGGA7ACAG 

G7CAGAAACCTTGAGCT7CTACTTC7CCA7CT7CCAA7TGTAGCATCCAG 

GACC7CAGAA7C7GCCAGC7AAGAGGAGCCG7AA7GA77G7C7GG7GGGA 

7A7GG7GGGACCACAGAGA7GAAGACA7GAA7AGC7A777GAA7G7GAAC 

AGCAGACGAAGAAATCAAGGCTAGGAGGGTGGAAG7GACTCATCCAATAG 

CACAG7G7GGT7GAAGCAGCAC7AG7A7CCAGG7TGCATGAGCCCC7GA7 

GCT77CGC7CGAGGGAAATT7TGGAGCCATGGGGCAA7GCCCCCTGACGT 

AACAG7C7CCACAGTTCTGCCA7G7C7CATCC7GGCCCTGTAACCTGGAC 

CCAAA7C7GCTACCA7CCCA7CCA7C7CAGGAAGTGAAACCTC7TA7G7C 

AAA7AGG77G7GCAACGTA7G7A7CAGA7CCTG7C7TCCCAAGGAGACCG 

C7CAGGCCACAGCACTTCCT7CCGA7CCCCAA7GAGCAGAAAATATCTCG 

C7A7AAACA7AGT7GGCAC7AAGGGAGGGAG7GGAAGAGTGA7GA7GA7G 

7AGA7GG7GA7G7AGCCCCAAGGAAG7GGAACAAGCAGAGA7GGGGAGC7 

•GGAAA7GCCAGGA7GC7CCAGCT777GGGGAA77A7TCAGC7C77GAG7C 

AC7AAAGCC777C7CAGC7GCAAG77CC7CTT7ACCC7GTO^GG7CAT7C 

7TCCAAGACAGGAGACTGACA777A77CAAAGCAGCAAG7GCCC7GATAC 

CA7C77G7G7C7AA7CA7GGGC77CGCAGCCAG77A7CAAGG77GA7C7C 

A7C7CA77GG7C77CAA7CA7777GAACAAGAAGACAAGCAAAA7AA7CA 
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TGGG77AG77C77A7A77A77G7G7G7ACA7GCAG7GA7G7C7G77C777 

G7AG7GAGC7377CC77CC7TG77CACCC7C77GC77AGAACAGAAC7AA 

3CAATCTGCCCCCAACATTTTCCCCAATTTCCCATCTCATTCTTGGCACT 

GGC77CC7AA7A777G77C77A7GAG7CA7777C77G7A7CA777CCA7G 

AG7CC C7 C7GGGA7C77AAAG7A7GAAAAA7G77G7G7G7AC CCACAC CT 

G7C777G7GGA7A777C7CTCC777CCC77C7GC77C7GGGA77A777GG 

GAAXGGGCAC7A7GA77777A7CA7A7CGC77CCAC77CC777A7GGCA7 

CA7C7CCAA7GGGCT7C77C7CCC7C77GGA7CCAGG77C7CAGA77GGG 

GACATGCAGAG7CCAAGGAACATTCCATTCTCCTCCCTGGTCTAGAACAA 

GGAGGGC77AGA7A7A7GAGCAGG7GGC7GGGGC7GGCGAGC7A7G7AG7 

C7CCAA7GGC7777CCC7GA7G7CGGAG77G7TA7G7CAG77C7GGGAGA 

CCAA7AAGACC77G7CC77CC777GGA7CCA7CAGAAAAAGCCCC7GGG7 

GGG7AAGA7GGA7GGCAGGGC7C7CC7AC7C7A7G7C7T77C7CACACC7 

AG7GGG7A7AAGAGAGGGGACCACAAACAGAGGGGGC7C7GG7ACCAC77 

A7CCAGGG7C7GGAAACA7T77C7G7AAAGGGCCAGA7AA7AAA7G777C 

AGG7ACAAC7AC7CAACC77GCA7CA777CAGAAAAGCAG7CftGA7AA7A 

CA7AAA7GAA7GGG7G7GGCTGGAC77G7CC7GCGG7CCCC7G7C77A7A 

TCA77G7A77A7A7CAT777T7C77ACA7ACAAA777AGAAGCAA7AC77 

AAAAAAAAAAAGCCG7CC777AT7GAGCACC7ACTAAG7GCCAGG7ACC7 

T77777CCC7CArrA7C77AT7AAC7C77CA7AA7AACC777AAAG7AGA 

TAA7A77GAACCA777GACC7A7GCAGAAAC7GAGG77GAGACAA7AAA7 

TA777AAGACCGCACAAACAG7AAA7GC7GGAAC7ACGAC7CAAA7A7GG 

G77AACTGAACCAAAACCAGA7C777A777C7CAC777TAA77G77AGA7 

A7G777A77GCC7CA7C7CC7G7CCACA7GG7GCCCA7CGGCAGAC7CC7 

T7C7CA77C7CAG7GAT7GAG7GACA77C7AAAC7ACAT7GGCC7GGCAG 

A77CACC7C7G7CCCC7AAA7G777CCACA77G7CC77T7AGGA77GAGA 

7CC7C7C7G77CCCT7G7CTTCCC7CC777C77C77C7GGCGG7GACG7G 

C7G7G7GAA777GTT7C7T7C7CC7C7CAGGG7AG7AC7GGGAC777CCA 

AA7CAGGG77777AA7GATC7C7Crr7CNCITITC7GAA77TC77CC77AT 

TCCCA77CAC777C7CA7C7A7AAG7GGCANC777G77GC7GGAAGA7A7 

CCC77G7GCAGGGAT7NC7C7T7AANAA777G7CNNNACC 

>ContigS4 

G7GA7CG7CAACCTCCCACCC7G7AGGGCC7CAAGCA7TGAGGACAA7CA 
C7GGC7GCCCA77AACCCAGAAA7G77GCCGAGACAGGAGGCCG7GGCCC 
AAG77CC7GGAA7GGGGTATTAT7A7G7CAGCACAAAGGCC777GCACAA 
A7GAAGGC777AAAAA7GCAG7CC7AG7CAGG7GGAGGAGGGC77A7AGG 
AT7CC CAGGAA7 C7GGA7CA77C7 C77GAGAGC777 CC C77G7 C7C7G77 
AAAAC7CACA7CGTACGGCCCAAA7AACAACAAAAAA7GGATG7AAAT7C 
77GAAA7AAC77G7GGATGGGGGAACAAGGCC CACC CCC CAGA7C7GCCA 
GAAGC77CAGG7GAGGGTCCCAAATGCCAAAAAG7C7GG7ATCAGAGAGG 
A7GGCCAG7GACNTGGGGACACATGCCC7TTGCTG7GTCAC7CAAGGAGC 
AGCAGC77CGGCCCCGCACAGTGACCAGGACCCTGGCT7CCCACGCTGGG 
CAGGAGC7GG7G7C7GATGAAGGGAA7GCC7GGCAGCACGTGC7GTC7GT 
C7CC7CG7G7CAGC77ACC7GGCTT7GC7GCGAAGAGGCCAC77GCA777 
C 77T A T 1 1TTT A7 A1V1 TITT A A ^ 

T7777A777A777A777A7T777AA7777TIT77AA7T7777AAA77A7G 
C777AAG777TAGGGTACATGTGCACAT7G7GCAGG7TAGT7ACATACGC 
A7ACA7GCGCCA7GC7GGTGCGCTGCACCCAC7AAC7CG7CA7C7AGCAT 
7AGGTA7A7CTCCCAGGTTAATCCC7CCCCCCCTCCCCCCACCCCACAAC 
AG7CCCCAGAA7G7GA7GT7CCCC77CC7G7G7CCA7G7GA7C7CAT7GA 
A7T7C777AAAGG7GGAATCTCTCAGTGGGG7C7AATC7GTTCAGAAATA 
7CAAAAGAG7A7CC7TGGGAATGACTGGAATTCCAGAG7CA7C7GG7AAT 
CC7CA7AAAACAACTCC7GGATGTC7C7CAGCACA7C7CCCACC7TGAAC 
GCAGGAGGC7GGTTCAAA7GGAGGAGCA7CGC7C7AC7GCACT77T7T7T 
777TT7GGCC7AAAGTGCAAAAGGGGATACG777CA7G7AAA7AAA7CAA 
C7GCAAA7CGC7AG7TA7GCTGAGCCC7G7CCCG7GCTG7GGACACAAAG 
GAACCAAAGGC7TTTC7CCCCGCCCAACACACACA7AACACACACACAAA 
A7CA7AAAAACA7ACA7ACCCCCAACACA7AACAACACACAACACACACA 
CAAAA7A7A7ACACACAACACACACCAAACA7GCCCACAAACC7G7G7CC 
AAAAA7AAA7CC7ACTGG7GGG7T7G7GG7C7CCC7AAC77CAAAAATGA 
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AGCCGTGGACCTTCGCAbTGAGTGTTACAGCTCTTAAAGATGGCATGGAT 

CCAAAGAGTGAGCAGTAGCAACGTTTACTGTGAAGAGCAAAAGGACAAAG 

CTTCCACAACCCAGAAGGGGACCCCAGCAGGGTTGCTGGTTGGGGTGGCC 

AGCTTTTACTTCCTTTTSGCCCC7CCCATGTTC7GTTTCCATCCTATCAG 

AGTGCCCTTTTTTCAATCCTCCCTGTGATTGGCTACTTTTAGAATCCTGC 

TGATTGGTGCATTTTACAGAGTGCTGATTGGTGCGTTTTACAATCCCCTT 

GT-AAGACAGAAAAGTTCCTGATTGGTGTGTTTTACAATCCTCTTGTAAGA 

CAGAAAAGTTCCCCAAGTCCCCACTGGACCCAGGAAGTCCACCTGGCCTC 

ACCTTTCAACTCCATAATGGCATGAAAATACATATGTTGTACAAAACATA 

CATACACAAAGTATACATGCATCTCCCCAAATATACACATACCACAGAAA 

CATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGCC 

T CTGCAGTGAGT AGTTAGAAAAGTGAATTTGTTTTTCAATAAATTGGAGT 

C CTTAAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAAA 

ATATGTATGTCCTTTGGTCTAGCATTTACACATGTAGGAATTTATCCTAG 

TGGAGTAATCAATGATATATGCAAAGATTTGGACAAGCATATTAAGCACA 

GAATTATGTATGCATATGTGTGTGTATATATATATATATCTGATACATAT 

AATAATGTAAAAGTGAAAATAACTCAGATGTTCAAAATTGAGGATTAGTT 

AGACTATGATCTGTCCATATGTGACATACAAGTTAGCTGCCCCtTATTCT 

CTCGAGCTTCAACCTCCTATAAACAGTGTCCCTTGTATATCAGTATTGGT 

ACAGATAATCGAACTTATTGAGGTTTTACATGGGGCAATAAAGGCAAGAG 

TTTATGAATACTCCATACTACACTAGGTAGCACCCCCTATTAAAGACAAA 

CTCTTCTCTCTCATTTCCCTTCCTTTCCGGAACCACTTGGTTGAATCTCT 

ACAAGTCrCTATTGCAACTGCCTCAACATGGCACCCTCCCTGCATCTCCA 

TCTTCCCTGTCCTGAGAGCAATGGCCTGCTGCCCCCACACTCACATCCTC 

ATTCATTCCAGAAGTGAGCACCACAGAAGTGCCTACAGTTACCCCAACCA 

CCTTCTTAGAAGATAAGTTAGTGTTTGTTTrGACTTTTTAAAATTTTTAC 

TTCCTCTTTTCCTTCACAATCTCATCCCATCCCAAGAGGTTTATCAAGAA 

GTTCTCTAAAGATATGTGTCTCCTTATGGAATTTAACAGAAATCAGGGAT 

TTGTATTCTAGCCATCAAGGGAATAACATTTTTCCAGGTCTTTAGACAAA 

TAATGGAATACCTTGCAGTAATTAGATACACTATTGTAGAAAAGTATTGA 

TGAAATGGAACGATGTTTGAGATATCATATTGAGTAGAAAAGGCAAGATA 

CATTAAGTAGGAAATGTATCTTACAAAATAATTTGTCAGACACACTCCTA 

TATTTGTATGTTATATAAATGCGTATGTGAAGAAAGGCTAGAGGATGAGA 

CCACAGTCTTCGGTGAAGTTTAAGAGATGAGGCTGCAGCATGCTCAGAAA 

GGCCTGGGTTATAGTTCTTCCAGTAATTAAGGATGTGATCTTGGGTAAAT 

TGTCCATCCTCTCTAAACTGCACCACCTTTTGTCTGTAAAACAGGAAGGA 

TGG7ATTTACCCCCAGGGTCATCAAAGGATTTGGTTGGAGAAAAATAAAT 

AAATGGGCTGAGC CCAGAC CTGGCACAGTGAGAGCACAGTGGTTGACTAT 

TGTGCTGGCC7GTTGTTCCTGTGTTATTGACATGC7GCTGGTGGTGGTCC 

AGAAGCTATTACCTTAATTGGTTATGTGGATTTCCCCTCATACTGAGCAG 

CTGTGTGTGGTGTTGTAAAACATAGCCATACACAGTAACTGACAAGGGCA 

AATGTGATGGAAAAATGCAAGGAAGTGCAGATAAATAGCTAATGGGCTGT 

AGAAGGAAGCTAGTCCTTGGAGGGCTTGATCAAGGAAGGTCCTTTTGCAT 

GTCACCTTTGAAGAAGAGGGGACATAGAAGAGGTATAGTGCATCCCGGAG 

TGTACCTGGAAGGGAACATGAAAAGAGGACATTTTTCTCTGGGACATGGG 

GACTCCACTTGCATGAACTCTGGAATTGGGGCAAAGAACCATCATGAGAA 

CAAGGGCTTCCTTGAACCTCCCAGGCTCATTGGCTGATCTAAACCCTGTG 

TCCCCTCTTTCCTTCACTCTCCTCTGTTTTCTATACCTGTATTATTGGAC 

TGGACTGGAAGCCACCTGATCTATCACAAGTACCTTGAAATGTGTTGAAT 

AGGTGTGGCACAGTCCTTAGCAGAGTGGCACTACCCCCACAGGAATTTGT 

TTATACCTTTGGCATGGAAAATAGCAGGAAATGAGTGATCACTGATAACT 

GAGGATGCTATTTATTATTGGCCAAAGGAATACTTGTGTTGTATTTGCAT 

AACCACTCACAAACTGTTGATTACAAATGAGTACCAGACCTAGCTCCTTC 

AAGTAAAGGATCCTGAGAACTGAAGGCAAACAGAGCTCCAGGAGTCCAAG 

ACAGAGCCACAGACCACGAGGATCCCTGGCCCAGGTAGGTGGTCCTCCTG 

CACTGGCTTTCAAGGCCAACAGGATGGATGGGGAAGTAGAGTAGCATCTG 

GCCATCTAGACCCTTGCTT T TTATCCCCACTGGAAGCACATCTGAATTTC 

TAAATATGATCTCTGAGACCTGCCCAGAACACCTTGCTCTCAGCCCCAGT 

AGCAGCCTGCTCTCTCCCAGGAGGGCTTCCACTAACAAGTAGGGCATTGC 

7 GGAGGGC CAGGCAGACACTAGCTTAGGAAATCCACCAACCCTGGAAATG 
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C7AGTC C C77 C7 C7GAAGGC7CAGAAGAC7GAC777AGAG7 C7AGAAAAT 

A77GG7CC77GGGAACAGA7777GAG7GCAAAGAGA7GGAC77CAGA7GG 

C-CAGATGCACTGCTTCTTTAGGGAATTCTGTGAAAGCTCCCTGCATT7AT 

C77AA7ACAGGCAGCAGA777CA7GAG7 AC CC C CGAGGG A7GGCC C CAGG 

TCCTCCAGCCTGTGAGCATCCTTCTGTCCTTCAGCAGCACCACAGTATCT 

77A7A7G7C777GGA7ACC7ACG7T7C7GCCAGACA7C7C77GC7C7GA7 

GXTCTGGCTGCCAAATTCTCTGTCAAGCGCCTCCAATTTTTTGTGTCCTT 

TGATTTACCCCAACATGACAAAGGCAGTTGTGCTTCATGTATTCAGGGAT 

ACTGCCAAACCACAAACAGGTTAAAATCAAATAGCAGATATCCCTGTTCC 

TAAAGACCCATCAGCTCTACCCACCTGCTCCTGCTCACCGTCCTTATTGT 

TGAGTCCTGAAGCCC7TCCTTGTCATTTTTATTTTTTGCATGAACAATTT 

AGTTCCCTTTGTCTCACTCCTAAACCTTTCTCAAAGGATTGGATTTGTAC 

ACAAACTGCCTATCTCTGCAATCTTAGAAGTGATATGATTCTGAACAAAT 

CACTTAACTTTTGATTTTTTATTGGTAAGATGGGAATACCAATTTTTGCT 

CCAC77C7G7CC7A7G77GGCC7GGGC7GA7G77GAAAGC7C7CGG7CAA 

CTGAGATAGGGTGTGCAGAATTTATATATATAAATATATCT€CTCCAACC 

CCTCCCAATGAAGCAAGTCACGTGAGTCAATCCTACCCTAAGATATTAGG 

GATTGAGCCTCCTGGGACATTTGGTGGCTTAGGTTTTCATGAAAAGAGGT 

TGCAGAGCAACTGCTTTT T GTTAGGCAAAGATTAGGCTACTGCAGAGACT 

CAGCAAACTTCTATAGAAGGTGTCAGATGGTAAGTATTTTAGGCTTTGCT 

TGCCAGATGATCTCTCAACTAGTTAACCATGCTATTGTAGCCTCGAAGCA 

GCCAGAGACAATATGTAAACAAGAGCATGGCTGTGTTTCAATAAAACTTT 

ATTTAAAAAAACAGTCAGGGACCGGATTTGGCCAAAGGCCATAGTGTGCC 

AGCCCCAAGACTAGAGCAATGCACTTTTAACTTTTTTATTTTATTTTTGT 

AAAATGCCAAGATCCACAAAAATGCTATTGCACCCCGTGTGTTAGCACTG 

TGACTCAAGGTTTGGGAAATTCTGCTTTGAAGGCGTGATAGACAGGAGAG 

CATGG7CTGGCCCCTTGGTGCCTTTCTGGTTGCAGCGAGCATTTCAAACT 

ACAGAGCAAGGCCAGTGGTCTGTTCAGCACTAGAGACATGCAGCAAGGTG 

TCCTGGGGTGAGAAGATGCCATAACTGGTCCCCTTTCTATCTCCTTAGGT 

CTTGGACTTCATTCCATTTTCTGTTGAGTAATAAACTGAACGTTGAAAAT 

GTCCTTTGTGGGGGAGAACTCAGGAGTGAAAATGGGCTCTGAGGACTGGG 

AAAAAGATGAACCCCAGTGCTGCTTAGAAGGTAAGGTTCTTGTAGAAATC 

TACCTCAGGGCCAAAGTGTAATTCCTAGAGCAGAACTTTGCTAG GTGCT G 

TGCACAGACCCAGTTGTTTCCTGCTGACTTGCACAGTAAGTGAGCTTTCA 

AATTTCCCTGGACAAATAACTAGACAAGAGAAATTCTGGAAGAGAAAAGG 

AAGCTTTGCTTCAGTGTCCAGGCACATCAGGTAGTAGATAAAAGGATCGT 

CCTCACCTACAGATTTGGGGCTTTAGCATCCTGTTTGCCAACTGGATGGT 

TGCA7A7GC77CAAAA7GCACC7C77CCC7CCCAACAT7CCCAAG7GGAA 

3AGAAGCC7CCGA7GAGAAGGAAC7C7C7AAGGC7GGGC7GAACAAATGA 

CCCAGGCACAGGGCATCTGAGTATTCCATGAGGAACACATTTGGGTGTTG 

CCCATGGGGGACAATAGGAGGAGGCTTTTGACCCAAATGATTGTCTACTG 

AGGTGTGACGGGAGAGGCCTGTGACATGCCAGAGGCCAAACCCGTGATCC 

AGTTCATCTCTATTCTATGTTTCTGAAGAGGGAAGCTATGATTTAATGTC 

ATTACTATCATGCTGCTCTAGTATTTCTCAGCACATACACAGAAGAGGGA 

ATTAAATGGTCCTTGATACCCCTAAATCCTTGGAAAATCCGAATTGCATA 

TGCTAAC CT CACTGCGTCTGACTGCAGAC CCGGCTGTAAGCC C C CTGGAA 

CCAGGCCCAAGCCTCCCCGCCATGAATTTTGTTCACACAAGTAAGGCCTC 

GGGGTGAGGTGATGGGGGTGGCTGAGGTGCGAGGGTGGGGATGGGGGATG 

GAGC CATTGGGTCCTCTTACAGGGTGAGAGAATTGTAGAATGGGGACACC 

TAAGGGTGCTGGATGGGGCTGAAGTCTTTCCTTTGTGGAAGCAAATCCCA 

TTAGGAGATAACTCTGGGAAAGATGAGCCCGGGGAGGGGCAGGTGATGCT 

CACCTGCTAAGAGGCAAAGGGCAAGGAAGAGTTTGTGCCTGGGAACCTTC 

CAGGTGCCTCTTCTGACCATAGCCAAGAGACTGGAGACACAGACCTCCTC 

CCAGCACTGAGGACAAACAGCCATGGGGCCAGTGGGGGTGCAGGGACACC 

CACACCACTAAGGGCTCAGGGCGGCGCCTTCAGAGCCTGAACCTTCCTCT 

CATGCTGCCATTTGAACACCACAACACCCTAATAGGAAACTGTTAACATT 

GCCACTGTTCAGGTGTGGAAACCGAGACAGACAGTGGAGATTCCCTGCCC 

TAGGTGACACAGGTAATAAGTGACAGATGTGGAAATTTAAAGGTACTATA 

ACG7C7C-7C7GCC7GAC7CAGGC77AAGGC7CCCA7CACC7CC7CT7C7C 

AGGACAGAGTCAGGAGGCCTCAGCCTGAGCCCCAGCTCTAGTGCAGGTTC 
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ATGTGGGAATACTGAGCurCACTAGTALrtATGGCAGAGAGGACCAAATGG 

GACCAGG7G7G7AAGGG7GCC7GGCACAG77GGGGGAGG C7GC 7G7CGC7 

TCTCCACCGCTGCTGCTGCAGTTACCTTTGATGTTTTAGTTTTGTTGTAG 

77ACACCAT7GC7GGC777GGA7C7GCAC7G7G7CCAC7CCAGG7GGAAC 

CACGCACACAAGCC7C7C7G7CGGGCC7G7CC7GAC77C7CC77G7CAGG 

GCTGGGATC7CCTTCAAATCTGGCGGAAGTGGTTCTCCAAGTCTGGTCCT 

CAAACGTCAGCAGCATCAGCGCCTAGAAGTGT7AGGAATACACATTCCCA 

GGCCCCACCACAGACCTCCTGCCTCAGAAACTCAGGGCGCTGAGGCTCTA 

GGGGC7GC777AACAAGCC77CCAGG77A7CG7GACGCACC77GAAAG7C 

7GAGAGC7AC7GCCC7ACAGAAAG77AC7AG7GCCC7AAAGC7GGCGC7G 

GCAC7GA7G77AC7GC7GC7G77GGAG7ACAAC77CCC7A7AGAAAACAA 

C7GCCAGCACC77AAGACCAC7CACACC77CAGAG7GGCCT7GAGAAAGA 

7T7GGGG7CAAGGA7CATGAGCGAGAACACCAC77AAGAGGA7AG7GAAC 

TAG7C7GCA7G7GAGACGC7GAGA7CC7ATG7CAGGC7G7GA7AGGAGGG 

AAACAGAAACCAAAGGAAAGAACAGC777AAGAAGCGC77AAGAGG7ACA 

AAG7AAAA7GA7GG7GC7AGAAAAG7AGC77C77AAAAAGAGCA7777CC 

AG7CTCACCC7GGAC7AAC7GAA7GAGAA7C7CAGGAG7G7GAGGCCCAG 

G7A7CCA7GG7C77AAAATGCCACCCACCAGG7GA77CCCAG7G7GCACC 

AGGGG7GAGAG7CACAGCC77AGGCCA7GCCAC7CAAAGGG7G7C77CAG 

ACCAGCAGCACCCACAGC7C7GGGAG7GCA7CAGAAAGACAGAGGC77GG 

CACCACCCACACC7AC7GAACCA7AG777GCAGG7GA777C77GCACA77 

AAAG7G7GGGAAA7GGAAAAGC7TAGAG77CAGC7AGC7CGG7GAC7C7C 

AG7CAACC7GCACC7GC7CCA7GAAC7CAGAC7GCC7GGGA7GGGCCCAG 

AAAAGC7CC7GAGGAGA77C7GA7G7AAGGCAGGGC7GA7AACCA7GGA7 

C7CA7C7GACCCCA7A7CAC7GGGGAG77AC77AGGA7C77GCC7GGGGC 

CAG7CA7C7C77CCA7AGACAC7GAGAG7G7CCACGA7GC77GGGGCAC7 

ACAGGG7GGGAGG7GGAGGA7CACGGG7GAG7CAGA7AGGAAGCC7GC7C 

C7GGGGAGC77ACAG7GCTA7AGGGCAGCAAGCCAAGGA7GCCAATACC7 

G7G7GCAGG7ACCAC7GACGAG7GCAGAGCGC7GCAGCACCAGAGAGGAA 

GCTACCC7G7GCAGAGGGGGCTGAGGAGGGC7GCAGGGAGATGACAGGAA 

AGCCGG7GT7ACAGGAGGAGTCC7CCCCAC7C777GGGCA7GAGGAGACC 

AGGAGGACA77CTACAG7GAGAAACCCAGGCAGAGGCCATGTGCTTA7GG 

CATGGGAAAAGAATGACACCrrAGACT7A77C7C7ACATTAGAATrGCC7 

ACCACAGA7ACCCA7A77ATAGC77CACA7AG7G7GG7GG77AC7G7G77 

77CA7A77G7CACA7T7GCCA7Tr7CCAGCCACCCACCCA77C77GACAG 

7CAC7GGCCCAGCC7GGGGGCCCC7G77C7TTA7CAAACAAG7GCC7GAG 

C7C777GCAGAGG7GAGGG7CACC7G7CCAATCAGAGGCCAGGAGGGAAC 

377C C C7777AAGACCC7AC7C7AGGCAGGCC7GGCCCAAA7GAG77GC7 

AGGAGCCCACGCCC7AAGAACCC7C7GAGCAC7G77G7GGC7GG7CC7GC 

7GC7AGAAG77G77CCTCCAGGGCCAGG7GCAAGAT77GTGGC7T77CAA 

AGGAGCCAC7AAAGC7CCAGC7CAGCCT7GCACGGTGC7GGGC7CC7GGG 

GGC77CC7GCC7CCAACCCTCCCAAC7C7TCCA7CACCGCTCCC77AGCC 

7GGCCAG7GCAGGGA7C7GT7CCACTC7AGGCAC7GC7GAGGGAA7GA7G 

CC7CCAG7CAGAGGG7GCAAAAAAGAGAGT7AAGAAAAACAATGA77A7A 

AAAAG7C C 7TT77 A TACGCCAGACATTTTCTTTGCTCAGGCTAAG7GCTA 

C77A777GAG7AAGCATTTTAGTTCTCATAAC7CC7CTC7CAAG7AGGTG 

C7GC7A77AC77TCA7TTCACAGA7GAGGACA77GAGGT77GGAGAGAC7 

7AG7AAC77G7CC7C7GTCC7ACAGCAGAGC7GGGA777GAA7C7A7CTG 

7CCAAA7C7GGAACCCAT7TGCTTGCACAGAAAGC77AA77GC77G7CCC 

AGCAAGA7AGAAAGCC7GGGAGTGGAAGAAA7ATTCAGTGGCTG7GATG7 

C7GAGCCCACAGGCAGGG7GGAGAGC7AGGGC7GGGGCCCTTGGACGTGG 

GGAAGAAAGGGC7GAGTCT7CCATTTTCAA7G7GAAG7G77GATA7CTGG 

TGA7A77GA7C7AGG7CCAAAGG7GAAGAAC77AAACCCGAAGAAA7TCA 

GCA77CA7GACCAGGA7CACAAAG7AC7GG7CCTGGAC7C7GGGA A7C7 C 

A7AGCAG77CCAGA7AAAAAC7ACA7ACGCCCAGG7GAC7C7CAG77T7G 

GC7G7G7777C7GCC7CCACC7AGCAGGGG7AAGGCC7CC7GC7AGG7GG 

GC7CAAC7CCA7GC7A7ACCA7GCCCCA7C7CCAGCAGG7GG7GGAAGCG 

AGGAGGAGAGGCCCCAGGGAC7AGGGCA7CAGA7GAAGGG7C7C7AGCAA 

7GACCAGA7C7GAAAG7AG7C777C7GGAAGGGC7GGAGAAAAAGAAGGA 

GGCAGACAC77AGAC7GGAAGAAGAGGAGGC77AAACCGG7G7GA7GGAG 
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GGAGAAGTGGACCACAGAvJTCAAGGGAGAGGGACTGTGCATCAGGCCTGA 

AACCCCAGCAGACAGGAGAGACCTTTCCCTGCTCTCAGAACCCACACATG 

TTCTGACTGTCTTTTTCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGA 

GCTCAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAA 

GGGGAGTTTTGTCTCTACTGTGACAAGGATAAAGGACAAAGTCATCCATC 

CCTTCAGCTGAAGGTGAGAGTTCTAGCTCAGTTTCCTGGGCCTTTGGCTA 

CGCCAAAGTAAAAGGCCAAGATCCTCAATGCCTCTCGCTTTCCTGCAAAT . 

TCTTATCTTGGCCAATATAACAGGGACATCCACCTTTCTGGAAGCACCAG 

GCAGAAGAGCCCCATAACTTCTTCTCTGGTTCCTTGCCCCTTCTAGGGAA 

GGAGGAGAGACTCCTCACAGCGGGGAGACAGCAAGGAGCTGAGCACCTGT 

TCTCCTCTCCTGGGCTCACTGGTCCTGGCCCTGGGCGGGTGGCGGTCCCC 

TCCTGCTGTGGCCCTCCATGTGGCAAGCAACACAATTGGGCCAGGACCCT 

GGCG7GCTGCTGTAGGGTAGGAGGGTGTGAGGGAGCACTCGGAGGGCAGT 

GTGTCTGCCCTGCAAATTTAGTCCTGGATGGAGCATCCTTTCACTTGAGG 

GGAGAAATCTTAGGAAGCTGAATTAGATACAGATCTAAGCCATATTCTCT 

AATT7TAAAAACTATAGAGCTGAGATTTTGGTATCCATCTGACTCTTACG 

TCTCTCTCTCTCTCTCTCTCTCTCTCAGTTTATTTTTAATCTGGGGGACA 

AGAAGGCCTGGAAAAGAGGGCATGATTGCTTATCATCCCTTAAATACCAG 

TACCAAGGCTGACACGTCATCTTTCCCAAGGACCATCTGCCTTCTCTCTT 

TTCCTCCTCTCCTGTGTAAAGGCCTGGAGGATGAGCACATGTGCTGTGTT 

TTCCTCCCTCTCAAAGCCTGTGCTATCTAATTAATCCCTTTTACCTCACA 

GAAGGAGAAACTGATGAAGCTGGCTGCCCAAAAGGAATCAGCACGCCGGC 

CCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGAACATGCTGGAGTCG 

GCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCC 

TGTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCAT 

TTCAACCAGTTTGCAAAGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAG 

GAAACTGCCCCATTGAACGCCTTCCTCGCTAATTTGAACTAATTGTATAA 

AAACACCAAACCTGCTCACTAAACTTTCTGTCATTGGGTTTCATTTCTCA 

TTCATGCTTTAAGGATTTGTGTTTTTAGGATATAGCAAGAAGCTTGTTTA 

ATTACAAAGTTCTGGGTTGGAAAGAGACCGGCTTCTGCTTGTGTACTGCT 

ACCCTGAACCATCAGACATGCATGTGTGTGTCATATGCTATGATGTGGCC 

AGTCTGAGTGCAATACTTGCAGCGGGAAGGAGCAGCTGGGTGCATGCTGT 

GCTCTAGAATTAGTCTTTCCTACTGGGGTTTGGTAGATTCTGAGGGCATT 

GATCCTGGGGCAGAAGTGGCTGAGTCTGTGTCTAGGGTACAGTGTGCAAG 

AAAGAAATGTAACAGCAAGTCACAATCCAGCCAAGTGATAGTGGAAAAGG 

GGTAGTTAGGTCCCAGATAAGGAGCAGGGTGACTTGACCTGTGGGAAAGG 

CACAGAGACAAGGAATCTGGGTCAGATGACAGCCAGGAGACCAGGTGAGG 

GAGGAGCCAGGTACTGTCTGGGAGGCTTGTCAACAAGGGCATGGTCCTAT 

CACTAAGCAGGGCTCAGATCCTCATAATGGGGGAGTGGAAGGCTGGCCGA 

ACAGAAATCAGGGCCTGGAAACAGAGTGAGGGGGTGGAGACAGGAGACTG 

AGGCTTGGAAATTAGTTTATTAGTTTTAGCTCTTCAGTTACAAGCAATAA 

TAATAGCTTCTAGCTTATTTAAGCAACAAGTATACTACAAAAGGAGCTTT 

CTAGAAGGATATTGGGTATATTCATTTCTTACTGCTGCTGTAACAAATTA 

CCACCAACTTAGTGGTTTAAACAATGCAATGTATTATCTTGCAGTTATGG 

AGGTCAGTCTGGAATGTGTCTCACTGGGCCAAAATCAAAGTATCAGCAGG 

ATAGCATTGCTTTGGGAGGCTCTAGGGGAGAGTCAATTTCCTTGCCTTTT 

CCAGCTTCCAGAG GCCAC CTGCATTCCTTGGCTAGTGGCCCACTCCCATC 

TTCGCTGCTTGGGTTTTTCTCACACTGCTTTGCTCTGACCCTCCTGCCTT 

CCTCTTTCACATATAAGAACGCTTGCAATTTACATCGGGCTCACGTCAAT 

ATCCAGGATACTCTCCCGTCTCAAAGAGGCTTAACTTTAATCACAGATGC 

AAAGTCCCTTTTGCTATGTCATGTAACATATACACAGGGTCTGGGGATTA 

GAATGTGGACATTTTCGGGGTGCCATTATTCTGCCTATCATGTGAAGTAA 

CTTTCAAAATGGAAAGACATGCTGAAGAAAAAGTCAGGGATTTCTGGCAG 

GCCAGAAATGACAGAAGGCAGAAAACGTTGGTCCCATCACTCAGATGGGT 

AAGAGCCAATCATG C TTTTTGTCAGTTAGCAAAAGATTGAGATTCCAAGC 

AAAGCATGCAACTGCCCTAGTTTGGGTCATGTGTCGACTCCTTGGTCAGT 

GAAGGGCAGCACACCTTGATCAATACTCCCTCCAAGACTGTATCCAACGA 

GGCCAG7GATGTTCCTCAAAGCAGAGCTAGAGAGCTAATCCCAGGAGAGA 

GGCGTGTGGGTGGTGGGCAGGAAGACAAAGCTCAGCCGTAAAGGAGTAGT 

AGGGACAGCACCCTAGGCATGGAGGCTCAAGTGAGATGATACCCATGGGA 
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AAAGCTCTGATAAGGTCKoCTCCTTCTGTTTCTGATCCTGATGGTGATGvi' 

TGATCAACACCAGCCCAGTGACAAAAAAGTACATAGTATATTTAGTAGAT 

GTTTCCCACACAGAGAAATGGTAAATATTCAAGGCGAGGAATACTCCAAA 

CATCC7ACCTTGATCATTACACATTCCGTGCATGTAATGAGTACTTGCAT 

GTATGCCATAAATATGTGAAAXATTATGTATCACTATATAAAAGAAAAAA 

AAATGTGGCCAGGTGACATCCATATTTTGGAGAGGAAGGCATGTCTTCTT 

CATAATATCACAAAACTATTTTCACAACAAAGACACAGCTGTTCAAATTA 

GTCTCTGAGCCGGGGCTGTCTCATGGCAGTGAGGACTCTGGTTCCCTTAC 

AGACTAGCAGAAAGGAGATGGGGCTTACTGACCATGGCCTTGAGGAGGCT 

GAACATGCAGGCCAAATGGAGACACAGACAGCCTGGGCTTGGTCCTGCTC 

CATCCCCTTCCAACCTGATGAGATATAGTGAGTCACTATGACGTGGGTCA 

CTCATGCTTCCTGTGAGGCTCCACCAAGACAGCAAGTGCATCAACACCTT 

ACGGAAGCACAAGGCCCTGTTTGTTGTTGACTTCATGAAAGGCATGGTTG 

TGGTGATCGCATTGAGTAGGCTTTTGGGTGAGAGGTGAAAAACCCCAACT 

ATCATGCATTGCAGCCCTCTGGTGGAAACTGTGCTTCAGGCTCTAAATTT 

CAGGCTCTAGACTGACTCCAGGATGAGTATTTGGAAGCTGAASTCAATCT 

GTGGTCTCTTCTCCTGTAGAGCAGGAGTCAGCACTTTTCATAGAGTGCCA 

GATTCTATATATCCTGCCACATGCTCTGTTGTTACAGAACAAAGAAGGCC 

ATAGACAGCATGGCTGTGTTGGCAAATACACAAAACAGGCAATAAGCTGT 

ATTTGGCCTTTAGGCTGCAGTTTGCCAACCCCTGCACTAACACAGAGCTT 

AAAGGTGGTGGTGGTGTGCTGGAGCTAGCTTATATCAGCTTGCAATAGCC 

AATTGCTAACATCTCTTCCAAACTCTGTGTCTGTGCCTTGATGTTGATAG 

TTTGAAATTGGCTACCCCATTTAATGCTGCAATCTTTTCTCACCCCAGCA 

CTACTGACTCCCCTTTGCCCTGTCTTATTTTTCTCACTCTAACATGCTGT 

ATAGTTTTCTTCTTACATTTATTGTTTGTGTCTTCCACTAGCATGTATGT 

CCCACAAGTTCTTTGCTCTGTGATGTATCCCAAGAACCCACTGCAGTGCT 

TGGCACTTGTAGGAACTCCATAAGATTTTTATAAATGAAGAAAGGAAGAA 

AAAAGAGAGGGAGGGAAAAAGGAAAGGAAGCCTTCTATTTAAATGATGGC 

CTTCTCCATATTTCTATAGTAATATGACTTCCCTTGCAAAGGGGGATGCA 

TTTTGGAAAATGTGTATAAATAAACTCAGGTGGTTTTGAATTTCATTTTC 

CTAACTGTAATTGTAATCATTGGTCTTTATGTTTAGTGAAAAAGTTTTGG 

CCCTTATGCCTCACACCTGAGAATCCCAAAGTATTGGTTTGTTAGAGCTC 

CCATAGAGAACCATAAACTGGGTGGCTTAAAACAACAGAAATGTATCGTC 

TCCTGGTTCAGGAGGCCAAAGTCTGAACTCCAGGTGTTGGTTCATTCTGA 

GAGCTCTGAGAGAGAATCTGTTCCAGGCTTCCCTTCAGTTTGTGGTAGCT 

CCAGGGTTCCTTGGCTGGTGGCAGCAAAACTCCAGTCTCTGCCCCCATCT 

TCACATGACTGTCTTCTCTCTGTGTTTCTGTGTCCAGATTGTCCTATAAG 

GACAGAGTCATACTGAATTAGGGCTCACTCGAATGACTTCATCTTAAGTT 

3AACTGTATCTGTAAAGACCTTATTTCCAAGTAAGGTCACATTCACAGCT 

ACTGGGGGATAGGACCTCAACATATCTTTTTGGGGGACATAATTCAACTC 

ATAATACCCAACATGATAACTGTTCATCCCATGAAATTTAATGTCTCTCA 

AAAGGTGATCTCAGGGCATTTAATCTGTGACAGAAACTCCCATAGGAAAC 

ATTCCAACCAGAAGCTCCTTTCACAGCTGGTCACTCCTCCTACCCCATCC 

GAGGTCCTGGGGCAGGGTGAGGCAGGTGGGGACAAGAAGAAGGCTGTCTC 

GGGTGTAGAAAGAGAAGACCCTTATTCACCCGGCACTCTGTTCATGAATG 

AGCTATCCAGCATAGGATATAATAAATCGCTTTAGGAGTGGTAGACTCCA 

AACATTTTTTTGGTCCCAGTTATCCTAATCAATTAAACAAACTCTAGAAC 

C CATCTTGAAGTGCAGGCATTGGGACATTATGAAACTTACACAGAATTCA 

AAAATTTACAAGGGCTAAATAAAACAGGGTCTGACATCTAATATTTTCTT 

CCCACATTCCCATGCACTGTCTGGCTCAACCATCCCCAACCCTCACTCTC 

ATCCTGGTGGACACATGCCTAGTGATGTGATCAGCTGGTTCACAGGGGGC 

TGGTGATGGTGGATATACAGCTTTTGCCAATTTCCATGGCATAACTACTC 

CAAATATGGCCAATTTCAAACTACCAACATGAAGGCACAGACACAGAGTT 

TGGAAGAGATGTTAGCAATTGGCTATTGCAAGCTGATATAAGCTAGCTCC 

AGCACAGCACCACCGCTACCTTTAAGCTCCTTGTGTTAGTGCAAGGGTTG 

GCAAACTGCAGCCTAAAGGCCAAATACAGCTTACTGCCTGTTTTGTGTAT 

TTGCCAACACAGCCATGCTGTCTATGGCCTTCTTTGTTCTGTAACAACAG 

AGCATGTGGCAGGATATATAGAATCTGGCAGTCTTTAATAAGTGCTGACT 

CCTGCTCTACAGGAGAACACAGATTGTCTTCAGCTTCCAAACATTCATCT 

CTGAGTCAG7CTAGAGCCTGAAATTTAGACTGAAGCACAGTTTCCACCAG 
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AGGGCTGCAATGCATGA'iAGTTGGGGTTTTCACCTCTCACCCAAAAGCCT 

ACTCAATTTTTTACTGCAAAAACATG7TATCATCATTATTTTTTACTTAG 

CCCACCTTTCCTTGGCAATTTTCCATAGGAAAATGCATTCTAAATTTCAA 

CTAATCAGGGGACTTGGAGCCTCTGGACACCCCCTTGTTCCTTGCCCACA 

GTCCCT7GCAGAAGGTGCCTTATCAGAGCGGCTCCATGCAGGGGCTCAGG 

ACAGGAT CAGATGTCAGTTGCAC CAAGGGGGCAGGGACAGATCCTCTCTG 

CTEACCATGCAGAAGGGACTGTTCAGTGCACCGTCATGGTCCTGGTGATT 

T CTGG T C C ATAAGGGAATTTTCACATG CATCGGGTGATTGTCACATCAGC 

ACAACACTGTGAGGAAGGCAGAGTGAGAATTTGTGTGCCCATTTTATAGG 

TGAGAAAACAGATGCAGAGACATTAAGTAACTTCACCACAGTCATGCGGG 

TTTTAAGTGGCAGACTTTCAGGTGT7GTGACTCCTAGTCCAGAGTTCTTT 

GCACTGCCCCTGAGGTGCTAAAACTCTACTGTGCTTTAAGACTCACTTGG 

GGAGCTTCCTAAAAAGAGAGATTGCACAACCTGAGATTCTTGTTTAACTG 

TTTTGGGATGTAGCTCAGGGATCTAGCTGCCTTAAAAAAAAAAACTCCCA 

AGTAATTCTGATGCAAGCGGTTCTTTTTTGTCCACCTTTGAAGAAACACT 

GCCTC CTCCCCATACATTTCATTAGAAAATGGTAACATGTTTTTCAGCCT 

GAGAGCCATTTCTGGGTGACCGGACGTCGGCAGCCCGCTGTACTAGCTTT 

CAGTCTAGGCTTAAACACACATGATAGGAGATGTCCTACTCCAGATGATA 

TGAGTCTGAACCATGGAAAAATTCCATTGTGTGGCACATCTGGTGGGTGT 

GCACTGTCCCCAGCAGTGAGGCACCCAGTGAAGACAGCAGCTGGGAGAGG 

CTTAGTTACATGCAGTGGGACAGTGTGGGCTAGACTGCTGAGCCCtCTGC 

AGTTTACTCTGTGTCAGGCAATGAGGGTGAAAGGCTGATCAGACCCACGT 

GCAGAC C AT AC C CT C CAGGGAGACAGAT AT CAGT CAGGACAAC C C CAAGT 

GTAGCTGGAGAAGCAGTGCCCAGGTATGACCGGATGTGTATCCAACCAGG 

AAATCTGCATATAAATATAAGAGGAGA AAAT GAACAGATGTTGCTCTTAT 

ATGTAGATATTTATGAAGAGCATATAATTTTGTTTTGTGTGTTTTAAGAA 

GTTTATAAGTATGCCTTAAAAATGTATAGTATATACTGTAGGTATTTTTT 

CCATTAGATATTTTGTTTTTCATACTTATCCACATTGACATTGTAGCAAC 

AGTATAATATAACAACCTCCTCTACAAAAGCAGAAGGAAGTGAAGCTTTG 

GAAGGAAGCACCCAGTGAGCTTGCCCCTTTCAGGTGGGTGCAGTGAGCAG 

GAGTCAGTGAGGTTGAGATCCTTTGAGAGGAGGCAATCATTAACCAGGAA 

ATCTGCACTGCATCCTGGCCACACCTAACCCTTGGACAATGGTGCTTGGA 

GCGCCTTCCAGCTCTTAAGGCTTGCGATTTCTTTCTCTCACTCTTCACCC 

ACGATGATTAAATCTTCTCCTACAGAGTTGGACAATAAAGCCTTGAGTTC 

CTGCCTCCCC7GGTGTGATCACGAGGCATAGACATGGCCAGGAACATGTA 

GGTGTCTTTGAAAGCTGAACAAGTTAGTAAATTTCAAACCTCATTTCACC 

CACCAGTAAAATGGGAATAATAATAAACCTATTTTACATAGGGTTGACAA 

GAGGAGTAAAGAGGGATTCAATGAAAGTTCGTTATTATCATTTGTAGTAG 

CAGTGTTGATAATATCAACTGAAAGTTCATTATCATTATTAGTAGCAGTA 

TTGATAACCCTCTTTTCTGTGCCTTCTCACTGGTGGGCCCAGGCCATCAG 

CAATGCCCAGGGTGTCATGGATCTCTGCTGCATCGGGCACCAGCTGTGTC 

AATGGTGAGAACAGTACAAGGGTGGGCAGGGCAAGGCAGGAAGCACCCAG 

GAGCAGCAGCTTCATGGGGTGAAGATGTCAGGAGCTTAGGGACAGTCAGA 

GCGGGTGTGCCTCCTCTTGTGGAGCCTTTCTGCGTGGGTAGGAACTGCTG 

CAGCTGTGGCCATGGATTCACCTGAATATGGGTGGAATTAGGCATTCAGC 

TGGGTTAGCTGTGCCTAGAAGGAGGAACTCTAAACTGAGAACTTGTCCCT 

ATTGCCACCTCTGATAGGCAGATGATCCATCCATCAGTGGCTGAGCTGAG 

GTGTGCATGGGGATGGGTAAGAGCCCACACACAGGGCTGATGACTGAGTC 

TATTTAGAACAATAGATGTAAAATCTGATAATGTAAAATGTGATAGATTA 

TTTTGTCAATTAGAAATGGTACCATATAATTATATATATACATAAACATG 

TATACATATACACACATATACATGTGTGTATAAACACACACAGTATTGTC 

CCCTACTCATTCCATAAACCTGATGCCTTTAGCTGGGATTCCCAGCTTTC 

ACTCTCCTCTCTGTCATCTGCTGTCTATATCCTCCCCATCCTGTAATTCT 

GGCTTATATGCCACTTCCTCCCTAAAGCCCTCCCTCAATCCCTTGCTGGA 

AGTGACATTTTCCTCTTTGAGCTGCCCCTGCTTGTGCTTTGGTGAGGTCA 

GCTGTATTGCAGTACCTTGTATTGTGGTTGTCACATCATCGTATAGAATT 

AATTTCTGACACATTCCGTATTTTTCAAAGGGCCTAGTGTGGGGCTTTAA 

CAGTAACTACGCCACCACGCCCAGTTAATTTTTTGTATTTTTGGTGGAGA 

CAAGGTTTCACCATGTTGGCCGGGCTGGTTTCGAACTCCTGACTTCAGGT 

GATCTGTCTGCCTCAGCCTCCTGGAGTGCTAGGATTGCAGGCATGAGCCA 
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CTGCACCCAGCCACCTATCAAAATTTTAAGTGCCATTTTTATTTTTTATT 
TTTTGTAGAAATGGACAAGCTGATCGCAAAATTCACATGGAATTGCAGGA 



ATCAGAACACTATGGTCCTGGCATAAGTGATGCTGGACAGGTGAGCCCCA 



ACAGCTCCAGCCCTGTTACAGCTCCAGCCCTGTTACAACTCTGACTACTC 
CTGCACAGAAGGGCTACCCTGTAGGCAGAGAGTAGCAACTCAGGGCAGTT 
TTGCAGTCATTrATATCCACTTTTAACACATGCAGATTAAGGGACAATTT 
ATGCAGAAATTTCTACGGAATTGGTAATAACTTTTGGGTCATGGAGTCAT 
CATGGAAGGGGGGCGGGGAACTCCCTGGTGTTGCCATGATGACGGTAAAC 
TGATATGGCGAACTGGTGGGTATGTCACATGAAAAGCTCCTTCCACCCCA 
GCCCTGTTTCAATTAGTCCTCGGTTTGGTCCAGTGTCCAAGTCCTGCCTC 


CTTT7ATGCTGGGTGCACACCTGTAATCCCAOtAU i i i^uuv. 
CAGGAGGATCTCTTGAGCCCAAGAAGCTGAGGCTACAGTGAGCCGAAATT 
GTGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCGAATA 
AATAAATAAACAAATATATAATTATAGATCTCTGGATCTTGCCTTCGGAG 
ACTGACTCAACTAACTGGTCTGGGTGGGAGCCCAGCCATTrGTATTTTTrt 
GAAAACTCTCCAAATGATTTTACTGTGCAGCQIAGGTTGAGAATCACTGT 


GAAAACTCTCCAAATGATTTTACTGTGCAGCUWjGXi uauak x w\v. a « * 
ATCATAGGGTTGGACTCCTAACTGGAAACAGTrTGCACCATCAGGTCTCG 
CAGCATTCTGATAATAGTrAAGCTTTCCTCCTAGATTTT^^AGA 
TGAGTCATGTTrACAAGTTTTTACCAAGAGACAAACTATCTTTCTGCC^CT 
TACTTTCTCTCTTATACTATTCTAATCCaWaAACCCTTTGGAACCTCCAC 
TGAGAGATGAATCTAGAAAGTGACTCTCTTGGCTACAACAGAGAGTAATG 
TTGGCCTGTTTGTGCCAGATCCAGTTGGTGCTGGTGGTGGGACAGCA 
CCCTGAAATCCCCTCCTCTCCCGTCAGATTCAGTCCCCCAT^CATCAC 
GTACAATCATCACTATGGGTTTCTATTACCTTGCrAGGGCATTTGGAGGT 
ACCA^TATACCAACTATTAGTrTTGAGCCATGGTTCCCAAAGTGTGGAC 
G^G^GCACCTCAGCACACTCACGAGGTGTCATGGGATATTTAAAT^ 

c?gaagWcacagtgacatctgtcaggcccgtg^ 

AAATTGTCTCAACCCAATTGCTTAAGAAGCAGAACTGGCCAGGCACGGTG 

gctScatctgtaatcccagcactttgggaggccgaggcgggcagatcac 

SSawSSaaaaj^ 
Sgcta^ggaggctga^^ 

tgagactacatctcaaaaaaaaaaaaatgagacsac»gagaga^ 

GACACGAAGGATACCATGAACTGAGAGATTTrGGGAACCTCTCCT^AGA 

ag^gga^gcattccttggg^ 

CCTTTTGATGGTGCATTTAGGTGTGGCAAGACAGCTCTTAGA^ 
G^TGTCCTTGGAGACAGAGGGAACTAC^TTGAGCTGCCCGATAAAGAC 
A^CAGCCTG^GAGTGTAGTGACTCATGTCTGTAATCCTAGTGCm 
GGGAGGCTGAAGTGGGAGGATTGCTTGAGGCCAGGGGTTTGAGATCAGCC 

?Sg^Sa^agacctctacaaaaaaawagaa^ 


AGGTCCTGGGAATAGGTAGAAGCCAAGGGAAAT^v*a>^av««j^ 
CCTGGAGCTGGAACTGGGGGAGCAGGATGGCCTCTGAGAAGTTCCTGATA 
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GTGGTGTCACTGATGTGTCTGATGTTTAGTTGTAATTATTTGCTGGGCCC 

CTGTCATCCCTCATATCTGATAGCTCTTTGCTAGTCAAAGTGTGGTCTGG 

GGATCAGCGGCATCAGCATCACTTGAGAACTTGTTAGAGATGCAGAATCT 

AGAG CC C CACC CGGGAC CCAGAAACAGAGCCTGCATTTTAACAAGCTCCC 

CAGGTGATTCTCACACACACTCGCATTTGAGAAGCACTGGGCTAGTTGAC 

AGATT CT CAGGCATGGCTGACATTGAAATATCCAGGGAGCAGGCTTGGCA 

TTAGGATGTTTAAAAGTCCTCCAGGTGTTTCTAAAGCCAGGTTTGAGGAA 

7TACTGGGCTGATACAAATGTTTTGTGATGATGCTTTGTGTGTGTGTGTG 

7GTGTG7GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGGAATTC 

TGGGTCACTTGGCACCAACACAGGAAACAATGGAAATATGTGAGCCATGA 

CAGAAAGGTCAGGAGATAAAAGAAATTAGTGACATGAGAGGTACTCCTCA 

GGTGTTAGGAAAGAGGGTAGAGCAAACCAGGTTTTCCACCATATGTTGGA 

TAGGGGGT CAAGTAAATTTCTACTTAAAAATTACAAACAGGGGCTGGGCG 

CGGTGGCTCATGCCTGTAATCCCGCACTTTGGGAGGCTGAGGAGGGCGGA 

TCACAAGGTCAAGAGATTGAGACCATCCTGGCCAACACGGTGAAACCGTG 

TCTCCACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGCGTGCCTTTA 

TTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAG 

GTGGAGGTTGCAGTGGGCCGAGATCGCACCACTGCAATCCAGAGCGAGAC 

TGTGTCAAAAAAAAAAAAAAAAAGAAAATTCCAAACAGGATGACCCTAAG 

CCTGCAGGACTTGGAGACATCTAGGTGACTGATACTCAGTCACAAAACAT 

AATTGGTCACAGGCCTGATGAAATGCACAGCAGACCTTCAGATGGTATGC 

ACTCAAGTGATATCCACAAGTCCACCTAAAGAAATGCTATATTCAGACAT 

TTGGCATCAATCTCTATCAAACAAAGATAGTCCAAAGCAATGGGTTCCAA 

AAACACTTTCCTAAGACAAATTCTCTATTTGCTTTTAATATCAGTCATCC 

CAGCCCTTGGAATAGAGGAGCAAATGATACCAGTGGTACCCTACCACAAT 

GCACCAAGGTATTATACTCTCATGCTCCATTTTCTCCCTCTGTCTACATC 

ACTAATAACTCATTGATTTCTGGTGCAAGCCCTGCTGGGAGAAAAAGTCT 

ACTCTTGTAC CTTGGAGCAAGTTGCTCAGAGTAGGTATCGAGGATAAAAT 

TTGGAAAGTTAGAAAAGCTATTAGAAGGAGATCCTAGTAGTTGAAAACAC 

AGCCTGGCCAAGTCAATGATGCTATTTCATCTCCCCAGCCTTGCATGTCC 

ATAGCTAAGGAAGACAATTTAGGCTTGGGCTAGAGGATGGGAAAGGGCAA 

AATTACTGATGCCACAGCCCAGAGAGGTATTCTAGTAATCTGAGGGTGAG 

GACCACATACCTGGTTCAGGGACGTACAGTGTTGACAGCTGTGAGTGGAT 

GCCTGGAGTTCTGGCGTGTCTTCTAGCACAATGATACCTGAGACTCTTGC 

ATCATTGGGAATAATAAAATGGGAGTGGATAGATATGAAATTATGATGGC 

AATAAGCAATCAGCTAATAGCTTCATTGATGGGACAGATTAAAGATGGCT 

GCAAATCCTTTGGTCCAGGTTTGGGATATAGGCAGCATTTGTATTGGAAT 

GCTGATAGTCTGAGGCCATGAAAAGTCCACCTGCAGTAGTGGTAGGAGGA 

ACAAGCCTCACTTTCTTCAATGTGTGTGACTGCTGTCTTGATTCCCTGGG 

TGGCCAGTTCCATTCGTGTGGTTCTTTGGTCCACTTGACTCTGGGGTGGC 

TCTGTGATGGCTTGACCAATACAATGTAGTGGAAATGATGCTGTCATCAT 

TTCCAGCCTCTTCCAGCCTTAAGGAACTGGCAACTTTTATTTCTGTCCCT 

TGGAATACTTGTTCTTGCAACCCATCCATCATACAGTGAGAAATTCTAAG 

CTGCCCCATTAAGAGGCCCACATGGTGATAAATTGGGGTCTTACATACAG 

CCCTAGCTGTGCTCCTAGCTGACAAACAGTAGCAACTTGTCACCAGGCGA 

GTGAACCACTTAGGACTGTATACTCCAGCCCCAGTTGAGCAATGTGGAAC 

AGAGTAAACCATCTCAGCTTAGCCCTGCCCAAACTGCAGAATTATGAGCA 

AAATAATCCCCTAGGCTTTGGGCTGATTTGTTCCAGATTACTGGAACAGA 

ATTTGGTACCAGGGGTGAGGTGCTACAGCAATGAAAGCTTAAGACACGTG 

ACTTTGGTTTTGGGTCTGAGTGGCAGGGGAACTTGGCAGGCCTCAAGGAA 

ACTTTTAGGGAGGGTTGAAGCATAGTGAGGAAAACAGTAGGGGAAGCTAG 

AGGAAAAAATGATGCTTGGTATGTAGTGGTGGGAAGTTTAGCAAAACTCG 

CCTGATGTAATGTGGGAAATTGTAAGAACTCAGAACGATTTAAGGGCATG 

TTTTATAGGTCCTTTAAGAAACTTCTAGGCCAGGCGCAGTGGCTCATGTC 

TGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACAAGGTCAGG 

AGATCGAGACAATCCTGGCTAACATTGTGAAACCCCGTCTCTACTAAAAC 

TACAAAAAAAAATTAGCCGGGCATGGTGGCGGGTGCCTGTAGTCCCAGCT 

ACTAGGGAGGCTGAGGCAGAAGAATGGCGTGAACCTGGGATGTGGATCTT 

GAAGTGAGCCCAGATTGTGCCACTGCACTCCAGCCTGGGCAACAGAGTGA 

GACT C CGTCTCAAAC CGAAAAAAAAAAAAAAAAAAAGAAACTTCTAGGGC 
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TGGTCCCGTGGAAGCCTCACACATGGTACACAAAGGCTGTCTTGAAAAGA 

AACGTAAGTGTGTTTTTTGGTTTAATAAAATTGATTATAAATGGATAATG 

CAAAACATTTTAAAGAATTTTACTAGCTTACATTAGCAGATTTGGATCCA 

GTGATTGTTACATTCTGGTACTGAGCCCCTGAATTACTTCTTTGAGTAAG 

GCATTATACCAAAGCTATTGATAGTTGGGCTTATAGGGTGTATGTTTGAA 

GAACTACTAATGTCAAAACCAATATTTCACGGTCGACAAGAGGACATCAG 

AACTGGTAATCCTTATTACCATGACTGGCTGGACAGAATACTCAATGTAA 

TSGGAT'TTCCTGCAAATAAAGACGGGGAAGATGTAAAAAAGATGCCTGAA 

CATTCAACATTAATGAAAGATTTCAGAAGAAATAXGTATACTAACTGCAG 

CCTTATCAAGTATATGGAAAAACACAAAGTTAAACCAGATAGTAAAGCAT 

TCCACTTGCTTCAGAAGTTTCTTACTATGGACCCAATAAAGTGAATTACC 

TGAGAACGGGGTCCCTGTTTCTTCGAAGACCCACTTCCTACATCAGACGT 

TTTCAACAGTTGTCAAATCCCCTACCCAAAATGAGAATTTTTAACAGAAG 

AAGAACCTGATGACAAAGGAGCCAAAAAGAACCACCACCGGCAGCAGGGC 

CATAACCACACGAATGGAACTGGCCACCCAGGAATCAAGACAACGGTCAC 

ACACAGGGACCCCCGTTGAAGAAAGTGAGGCTTGTTCCTCCTACCACTAC 

CTCAGGTGGACTTTTCACGGCCTCAGACTATCCGCGTTCCAATCCACATG 

CTGCCTATATCCCAACCCTGGACCAAGCACATCCCAGCCGAAGAGCAGTG 

TAGGATACTCAGCTACCTCCCAGCAGGCTCCACAGGACCCACGTCAGACA 

CACGGGTACTGAGCTGCATCGGAATCTTGTCCGTGCACTGTTGTGAATGC 

TGCAGGGCTGACTGTGCAGCTCTCCGTGGGAACCTGGTATGGGCCATGAG 

AATGTACTGTACAACCACACCTGCCCAGTAGCCAAGTTCCTTCCACCGCT 

TTTCACAGATCGGGGTAGTGGCTTCCAGTTTGTACCTATTTTGGAGTTAG 

ACCTGAAAAGAAAGCGCTAGCACAGTTTGTGTTGTGGATTTGCTACTTTC 

ATAGTTAACTTGACCTGGCTCAGACTGACCAGTACTTTTTTTTCCGTGAC 

AGTCTATAGCAGTTGAAGCTGAGAATGTGCTAGGGGCAAGCGTTTGTCTT 

CATATGTCATGAATTCCTCCAGTGTAACAACATTATCTGACCAATAGTAC 

ACACACAGACACAAGGTTTAACTGGTACTTGAAAACATACAGTAGGTGTT 

AACTCAGTGAAATAACCAGGACTCAAAGTAAG ATTAT TTTGGTACACCTT 

TCTTGTTAGTGTCTTATCAGTGAGTTGATTCATTTTCTACATTAATCAGT 

GTTTTCTGACCAAGAATATTGCTTGGATTTTTCTGAAAGTACAAAAAGCC 

ACATAGTTTTTTTCAGAAAGGTTTCAAAACTCCTAAAGATTAATTTCCAA 

GTATAAGTrTGTTTTTATTTTCAATCTATGACTTGACTGGTAT TAAAG CT 

GCTATTTGATAGTAATTAGATATATTCTCATTGATATAAAC CTGTTT GGT 

TCAGCAAACAAACTAAAATGATTGTCACAGACAATGCTTTATTTTTCCTG 

TTGGTGTTGCTTGTGGGAAAAAGAAAGAGAGA TCAG ATTGTTACTGTGTC 

TGTGTAGAAAGAAGTAGACATAGGAGACTCCATTTTGTTCTGTACTAAGA 

AAAATTCTTCTGCCTTGAGATGCTGTTAATCTATATAACCTTACCCCCAA 

C CCTGTGCTCTCTGAAACATGTGCTGTGTCCACTCAGGGTTAAATGGATT 

AAGGGCGGTGCAAGATGTGCTTTGTTAAACAGATGCTTGAAGGCAGCATG 

CTCGTAAGAGTCATCACCACTCCCTAATCTCAAGTACCCAGGGACACAAA 

CACTGC7GAAGGCCGCAGGGACCTCTGCCTAGGAAAGCCAGGTATTGTCC 

AAGGTTTCTCCCCATGTGATAGTCTGAAATATGGCCTCGTGGGAGGGGAA 

AGACCTGACCGTCCCCCAGCCCGACACCCGTAAAGGGTCTGTGCTGAGGA 

GGATTAGTATACGAGGAAGGAACGCCTCTTTGCAGTTGAGACAAGAGGAA 

GGCATCTGTCTTCTGCCCGTCCCTGGGCAATGGAATGTCTCGGTATAAAA 

CCCGATTTTATGTTCCATCTACTGAGATAGGGGAAAACCACCTTAGGGCT 

GGAGGTGGGACATGCGGCAGCAATACTGCTCTTTA AGACA TTGAGATGTT 

TATGTGTATGCATATCTAAAGCACAGCACTTAATTCTTTACCTTGTCTAT 

GTTGCAGAGACCTTTGTTCACGTGTTTATCTGCTGACCTTCTCTCCACTA 

TTATCCTATGACCCTGCCACATCCCCCTCTCCGAGAAACACCCAAGAATG 

ATCAATAAATACTAAGGGAACTCAGAGGCCGGCGGGATCCTCCATATACT 

GAACGCTTGTCCCCTGGGCCCCCTTATTTCTTTCTCTATACTTGGTCTCT 

GTGTCTTTTTCTTTTCCAAGTCTCTCGTTCCACCTAATGAGAAACACCCA 

CAGGTGTAAAGGGGCAACCCACCCCTTCATTGCTGATTTGTGAGCGTGCT 

T TAAGGTGAAAAAAGCATGAATGTTAACTTCCTTAAAAAGGTACAGCATC 

CAATTCAAATATTTTTGTCCTGATTTTAATGCTAGTTGATGTAGTGCTAT 

TAAAATTTTGTTCAACATGGACACAGAGAGGGGAACAACACATACCAGGG 

C^TGTTGCGGGGTGGGGATGAGGGGAGGGAACTTAGAGGACAGGTGAACA 

GGTGCAGCAGATCACCATGGCCCACATATACCTATTTAACAAACCTGCAC 
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GTTCTGCACACGTATCCCATTTCTTTTTTTTTTTAAGAAATAGAAAAAAA 
AATAAAATTTTGTTCACTGATTCTTCCATTTTAAAACTTGTTTGCATGTG 
GTTTAGGATGCCCTTACTTCAGCAAAGGAGAAGGAATAGGAGGGCCTTAG 
AATTTTTGAGGGAAAAAAACCCTATAACATACATTGTACTGTATCAAACr 
ATTTTACATGAATGACACAAGTATTCTGAATAAAAAAATAATTGAACATT 
GTTAAGAACAAGGTGTCATGTAATTrATTTTTCATAAATAAAAAAATTAT 
AGrGGCTTAGACTGAAAGGAACAGAGAATTTAAAAAATTAAAAAGAAGCC 
TTAGTATATTTTTGTATATAGTTTCCATGTGCCATATTTGCCATAATTGG 
ATGAGAATTTTTTGACCTCTGGCAGGGTGACCCTATATTTTCANTNTATA 
AAGCGTGCATCATACC 
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